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TITLE 

PLANT CELLULOSE SYNTHASES 
This application claims the benefit of U.S. Provisional Application No. 60/092,844. 
filed July 14, 1998. 
5 FIELD OF THF INVENTION 

This invention is in the field of plant molecular biology. More specifically, this 
invention pertains to nucleic acid fragments encoding cellulose biosynthetic enzymes in 
plants and seeds. 

BACKGROUND OF THE INVENTION 
10 Cellulose is a major component of plant fiber, e.g. cotton fiber. Cellulose is 

composed of crystalline beta-1.4-glucan microfibrils (see World Patent Publication 
No. WO 98/00549). These microfibrils are strong and can resist enzymatic and mechanical 
degradation and are important in determining nutritional quality of animal and human 
foodstuffs. Hence, modification of the biosynthetic pathway responsible for cellulose 
15 synthesis through modification of cellulose synthase activity could potentially alter fiber 
quantity, either by producing more or less fiber in a particular plant species or in a specific 
organ or tissue of a particular plant. Modification of cellulose synthase activity could 
increase the value of the fiber to the end-user and may improve the structural integrity of the 
plant cell wall. Lastly, because cellulose is a major cell wall component, inhibition of 
20 cellulose synthesis would probably be lethal. Thus, cellulose synthase may serve as the 
target for a novel class of herbicides. Plant cellulose synthase genes, homologs of the 
bacterial eel A genes encoding the catalytic subunit of cellulose synthase, have been reported 
from cotton. Arabidopsis. rice and alfala (World Patent Publication Nos. WO 98/00549 and 
WO 98/18949). 

25 There is a great deal of interest in identifying the genes that encode proteins involved 

in cellulose synthesis. These genes may be used in plant cells to control the synthesis of 
cellulose. Accordingly, the availability of nucleic acid sequences encoding all or a portion 
of a cellulose synthase would facilitate studies to better understand cellulose synthesis in 
plants and provide genetic tools to alter cellulose production. 

30 SUMMARY OF THE INVENTION 

The instant invention relates to isolated nucleic acid fragments encoding cellulose 
biosynthesis enzymes. Specifically, this invention concerns an isolated nucleic acid 
fragment encoding a cellulose synthase and an isolated nucleic acid fragment that is 
substantially similar to an isolated nucleic acid fragment encoding a cellulose synthase. In 

35 addition, this invention relates to a nucleic acid fragment that is complementary to the 

nucleic acid fragment encoding cellulose synthase. An additional embodiment of the instant 
invention pertains to a polypeptide encoding all or a substantial portion of a cellulose 
synthase. 
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In another embodiment, the instant invention relates to a chimeric gene encoding a 
cellulose synthase, or to a chimeric gene that comprises a nucleic acid fragment that is 
complementary to a nucleic acid fragment encoding a cellulose synthase, operably linked to 
suitable regulatory sequences, wherein expression of the chimeric gene results in production 
5 of levels of the encoded protein in a transformed host cell that is altered (i.e., increased or 
decreased) from the level produced in an untransformed host cell. 

In a further embodiment, the instant invention concerns a transformed host cell 
comprising in its genome a chimeric gene encoding a cellulose synthase, operably linked to 
suitable regulatory sequences. Expression of the chimeric gene results in production of 
10 altered levels of the encoded protein in the transformed host cell. The transformed host cell 
can be of eukaryotic or prokaryotic origin, and include cells derived from higher plants and 
microorganisms. The invention also includes transformed plants that arise from transformed 
host cells of higher plants, and seeds derived from such transformed plants. 

An additional embodiment of the instant invention concerns a method of altering the 
15 level of expression of a cellulose synthase in a transformed host cell comprising: 
a) transforming a host cell with a chimeric gene comprising a nucleic acid fragment 
encoding a cellulose synthase; and b) growing the transformed host cell under conditions 
that are suitable for expression of the chimeric gene wherein expression of the chimeric gene 
results in production of altered levels of cellulose synthase in the transformed host cell. 
20 An addition embodiment of the instant invention concerns a method for obtaining a 

nucleic acid fragment encoding all or a substantial portion of an amino acid sequence 
encoding a cellulose synthase. 

A further embodiment of the instant invention is a method for evaluating at least one 
compound for its ability to inhibit the activity of a cellulose synthase, the method comprising 
25 the steps of: (a) transforming a host cell with a chimeric gene comprising a nucleic acid 
fragment encoding a cellulose synthase, operably linked to suitable regulatory sequences; 
(b) growing the transformed host cell under conditions that are suitable for expression of the 
chimeric gene wherein expression of the chimeric gene results in production of cellulose 
synthase in the transformed host cell; (c) optionally purifying the cellulose synthase 
30 expressed by the transformed host cell; (d) treating the cellulose synthase with a compound 
to be tested; and (e) comparing the activity of the cellulose synthase that has been treated 
with a test compound to the activity of an untreated cellulose synthase, thereby selecting 
compounds with potential for inhibitory activity. 

BRIEF DESCRIPTION OF THE 
35 DRAWINGS AND SEQUENCE DESCRIPTIONS 

The invention can be more fully understood from the following detailed description 
and the accompanying drawings and Sequence Listing which form a part of this application. 
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10 



Figure 1 shows a comparison of the amino acid sequences set forth in SEQ ID NOs:2, 
4, 8, 10, 12, 14, 16, 18, 20 and 22 and the Arabidopsis thallana sequences (SEQ ID NOs:23 
(gi 2827139), 24 (gi 2827141), 26 (gi 4467125), 27 (gi 4886756) and 29 (gi 3 13561 1)) and 
Gossypium hirsutum sequences (SEQ ID NOs:25 (gi 1706958) and 28 (gi 5081779)). 

Table 1 lists the polypeptides that are described herein, the designation of the cDNA 
clones that comprise the nucleic acid fragments encoding polypeptides representing all or a 
substantial portion of these polypeptides, and the corresponding identifier (SEQ ID NO:) as 
used in the attached Sequence Listing. The sequence descriptions and Sequence Listing 
attached hereto comply with the rules governing nucleotide and/or amino acid sequence 
disclosures in patent applications as set forth in 37 C.F.R. §1.821-1.825. 



15 





TABLE 1 








Cellulose Biosynthetic Enzymes 




Protein 




SEQ ID NO: 


1 1/">T\0 noctnnntiAn 

^lone i-Jcsignauon 


\i\ ucieouae ) 


(Amino Acid) 


Cellulose Synthase 


bshl.pk0002.f6 


i 


2 


Cellulose Synthase 


Contig composed of: 
ccoln.pk0005.g3 
cdt2c.pk002.gl 
cdt2c.pk002.116 
csclc.pk002.il 
p0031.ccmar05rb 
p0110.cgsma57r 


3 


4 


Cellulose Synthase 


crln.pk0135.el0 


5 


6 


Cellulose Synthase 


p0097.cqradl7rc 


7 


8 


Cellulose Synthase 


p0122.ckamh70rc 


9 


10 


Cellulose Synthase 


rlr24.pk0073.gl 


11 


12 


Cellulose Synthase 


sdp2c.pk005.o22 


13 


14 


Cellulose Synthase 


ses8w.pk0028.D 


15 


16 


Cellulose Synthase 


ssl.pk0036.cl0 


17 


18 


Cellulose Synthase 


Contig composed of: 
wll.pk0009.c9 
wrl.pk0160.dll 
wreln.pk0043.f9 
wreln.pk0043.h8 
wreln.pk0131.gl0 


19 


20 


Cellulose Synthase 


wlln.pk0044.bl 


21 


22 



The Sequence Listing contains the one letter code for nucleotide sequence characters 
and the three letter codes for amino acids as defined in conformity with the IUPAC-IUBMB 
standards described in Nucleic Acids Research 75:3021-3030 (1985) and in the Biochemical 
Journal 219 (No. 2;.345-373 (1984) which are herein incorporated by reference. The 
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symbols and format used for nucleotide and amino acid sequence data comply with the rules 
set forth in 37 CF.R. §1.822. 

DETAILED DESCRIPTION OF THE INVENTION 

In the context of this disclosure, a number of terms shall be utilized. As used herein, a 
"nucleic acid fragment" is a polymer of RNA or DNA that is single- or double-stranded, 
optionally containing synthetic, non-natural or altered nucleotide bases. A nucleic acid 
fragment in the form of a polymer of DNA may be comprised of one or more segments of 
cDNA, genomic DNA or synthetic DNA. 

As used herein, "contig" refers to a nucleotide sequence that is assembled from two or 
more constituent nucleotide sequences that share common or overlapping regions of 
sequence homology. For example, the nucleotide sequences of two or more nucleic acid 
fragments can be compared and aligned in order to identify common or overlapping 
sequences. Where common or overlapping sequences exist between two or more nucleic 
acid fragments, the sequences (and thus their corresponding nucleic acid fragments) can be 
assembled into a single contiguous nucleotide sequence. 

As used herein, "substantially similar" refers to nucleic acid fragments wherein 
changes in one or more nucleotide bases results in substitution of one or more amino acids, 
but do not affect the functional properties of the polypeptide encoded by the nucleotide 
sequence. "Substantially similar" also refers to nucleic acid fragments wherein changes in 
one or more nucleotide bases does not affect the ability of the nucleic acid fragment to 
mediate alteration of gene expression by gene silencing through for example antisense or co- 
suppression technology. "Substantially similar" also refers to modifications of the nucleic 
acid fragments of the instant invention such as deletion or insertion of one or more 
nucleotides that do not substantially affect the functional properties of the resulting 
transcript vis-a-vis the ability to mediate gene silencing or alteration of the functional 
properties of the resulting protein molecule. It is therefore understood that the invention 
encompasses more than the specific exemplary nucleotide or amino acid sequences and 
includes functional equivalents thereof. 

For example, it is well known in the art that antisense suppression and co-suppression 
of gene expression may be accomplished using nucleic acid fragments representing less than 
the entire coding region of a gene, and by nucleic acid fragments that do not share 100% 
sequence identity with the gene to be suppressed. Moreover, alterations in a nucleic acid 
fragment which result in the production of a chemically equivalent amino acid at a given 
site, but do not effect the functional properties of the encoded polypeptide, are well known in 
the art. Thus, a codon for the amino acid alanine, a hydrophobic amino acid, may be 
substituted by a codon encoding another less hydrophobic residue, such as glycine, or a more 
hydrophobic residue, such as valine, leucine, or isoleucine. Similarly, changes which result 
in substitution of one negatively charged residue for another, such as aspartic acid for 

4 
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glutamic acid, or one positively charged residue for another, such as lysine for arginine, can 
also be expected to produce a functionally equivalent product. Nucleotide changes which 
result in alteration of the N-terminal and C-terminal portions of the polypeptide molecule 
would also not be expected to alter the activity of the polypeptide. Each of the proposed 
modifications is well within the routine skill in the art, as is determination of retention of 
biological activity of the encoded products. 

Moreover, substantially similar nucleic acid fragments may also be characterized by 
their ability to hybridize, under stringent conditions (0.1X SSC, 0.1% SDS, 65°C), with the 
nucleic acid fragments disclosed herein. 

Substantially similar nucleic acid fragments of the instant invention may also be 
characterized by the percent identity of the amino acid sequences that they encode to the 
amino acid sequences disclosed herein, as determined by algorithms commonly employed by 
those skilled in this art. Preferred are those nucleic acid fragments whose nucleotide 
sequences encode amino acid sequences that are 80% identical to the amino acid sequences 
reported herein. More preferred nucleic acid fragments encode amino acid sequences that 
are 90% identical to the amino acid sequences reported herein. Most preferred are nucleic 
acid fragments that encode amino acid sequences that are 95% identical to the amino acid 
sequences reported herein. Sequence alignments and percent identity calculations were 
performed using the Megalign program of the LAS ARGENE bioinformatics computing suite 
(DNASTAR Inc., Madison, WI). Multiple alignment of the sequences was performed using 
the Clustal method of alignment (Higgins and Sharp (1989) CABIOS. 5:151-153) with the 
default parameters (GAP PENALTY=10, GAP LENGTH PENALTY=10). Default 
parameters for pairwise alignments using the Clustal method were KTUPLE 1, GAP 
PENALTY=3, WINDOWS and DIAGONALS SAVED=5. 

A "substantial portion" of an amino acid or nucleotide sequence comprises an amino 
acid or a nucleotide sequence that is sufficient to afford putative identification of the protein 
or gene that the amino acid or nucleotide sequence comprises. Amino acid and nucleotide 
sequences can be evaluated either manually by one skilled in the art, or by using computer- 
based sequence comparison and identification tools that employ algorithms such as BLAST 
(Basic Local Alignment Search Tool; Altschul et al. (1993) J. Mol Biol. 275:403-410; see 
also www.ncbi.nlm.nih.gov/BLAST/). In general, a sequence often or more contiguous 
amino acids or thirty or more contiguous nucleotides is necessary in order to putatively 
identify a polypeptide or nucleic acid sequence as homologous to a known protein or gene. 
Moreover, with respect to nucleotide sequences, gene-specific oligonucleotide probes 
comprising 30 or more contiguous nucleotides may be used in sequence-dependent methods 
of gene identification (e.g., Southern hybridization) and isolation (e.g., in situ hybridization 
of bacterial colonies or bacteriophage plaques). In addition, short oligonucleotides of 12 or 
more nucleotides may be used as amplification primers in PCR in order to obtain a particular 
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nucleic acid fragment comprising the primers. Accordingly, a "substantial portion" of a 
nucleotide sequence comprises a nucleotide sequence that will afford specific identification 
and/or isolation of a nucleic acid fragment comprising the sequence. The instant 
specification teaches amino acid and nucleotide sequences encoding polypeptides that 
5 comprise one or more particular plant proteins. The skilled artisan, having the benefit of the 
sequences as reported herein, may now use all or a substantial portion of the disclosed 
sequences for purposes known to those skilled in this art. Accordingly, the instant invention 
comprises the complete sequences as reported in the accompanying Sequence Listing, as 
well as substantial portions of those sequences as defined above. 
10 u Codon degeneracy" refers to divergence in the genetic code permitting variation of 

the nucleotide sequence without effecting the amino acid sequence of an encoded 
polypeptide. Accordingly, the instant invention relates to any nucleic acid fragment 
comprising a nucleotide sequence that encodes all or a substantial portion of the amino acid 
sequences set forth herein. The skilled artisan is well aware of the "codon-bias" exhibited 
15 by a specific host cell in usage of nucleotide codons to specify a given amino acid. 

Therefore, when synthesizing a nucleic acid fragment for improved expression in a host cell, 
it is desirable to design the nucleic acid fragment such that its frequency of codon usage 
approaches the frequency of preferred codon usage of the host cell. 

"Synthetic nucleic acid fragments" can be assembled from oligonucleotide building 
20 blocks that are chemically synthesized using procedures known to those skilled in the art. 
These building blocks are ligated and annealed to form larger nucleic acid fragments which 
may then be enzymatically assembled to construct the entire desired nucleic acid fragment. 
"Chemically synthesized", as related to nucleic acid fragment, means that the component 
nucleotides were assembled in vitro. Manual chemical synthesis of nucleic acid fragments 
25 may be accomplished using well established procedures, or automated chemical synthesis 
can be performed using one of a number of commercially available machines. Accordingly, 
the nucleic acid fragments can be tailored for optimal gene expression based on optimization 
of nucleotide sequence to reflect the codon bias of the host cell. The skilled artisan 
appreciates the likelihood of successful gene expression if codon usage is biased towards 
30 those codons favored by the host. Determination of preferred codons can be based on a 
survey of genes derived from the host cell where sequence information is available. 

"Gene" refers to a nucleic acid fragment that expresses a specific protein, including 
regulatory sequences preceding (5' non-coding sequences) and following (3* non-coding 
sequences) the coding sequence. "Native gene" refers to a gene as found in nature with its 
35 own regulatory sequences. "Chimeric gene" refers any gene that is not a native gene, 
comprising regulatory and coding sequences that are not found together in nature. 
Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that 
are derived from different sources, or regulatory sequences and coding sequences derived 
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from the same source, but arranged in a manner different than that found in nature. 
"Endogenous gene" refers to a native gene in its natural location in the genome of an 
organism. A "foreign" gene refers to a gene not normally found in the host organism, but 
that is introduced into the host organism by gene transfer. Foreign genes can comprise 
native genes inserted into a non-native organism, or chimeric genes. A "transgene" is a gene 
that has been introduced into the genome by a transformation procedure. 

"Coding sequence" refers to a nucleotide sequence that codes for a specific amino acid 
sequence. "Regulatory sequences" refer to nucleotide sequences located upstream (5' non- 
coding sequences), within, or downstream (3' non-coding sequences) of a coding sequence, 
and which influence the transcription, RNA processing or stability, or translation of the 
associated coding sequence. Regulatory sequences may include promoters, translation 
leader sequences, introns, and polyadenylation recognition sequences. 

"Promoter" refers to a nucleotide sequence capable of controlling the expression of a 
coding sequence or functional RNA. In general, a coding sequence is located 3* to a 
promoter sequence. The promoter sequence consists of proximal and more distal upstream 
elements, the latter elements often referred to as enhancers. Accordingly, an "enhancer" is a 
nucleotide sequence which can stimulate promoter activity and may be an innate element of 
the promoter or a heterologous element inserted to enhance the level or tissue-specificity of 
a promoter. Promoters may be derived in their entirety from a native gene, or be composed 
of different elements derived from different promoters found in nature, or even comprise 
synthetic nucleotide segments. It is understood by those skilled in the art that different 
promoters may direct the expression of a gene in different tissues or cell types, or at 
different stages of development, or in response to different environmental conditions. 
Promoters which cause a nucleic acid fragment to be expressed in most cell types at most 
times are commonly referred to as "constitutive promoters". New promoters of various 
types useful in plant cells are constantly being discovered; numerous examples may be 
found in the compilation by Okamuro and Goldberg (1989) Biochemistry of Plants 75:1-82. 
It is further recognized that since in most cases the exact boundaries of regulatory sequences 
have not been completely defined, nucleic acid fragments of different lengths may have 
identical promoter activity. 

The "translation leader sequence" refers to a nucleotide sequence located between the 
promoter sequence of a gene and the coding sequence. The translation leader sequence is 
present in the folly processed mRNA upstream of the translation start sequence. The 
translation leader sequence may affect processing of the primary transcript to mRNA, 
mRNA stability or translation efficiency. Examples of translation leader sequences have 
been described (Turner and Foster (1995) Molecular Biotechnology 3:225). 

The "3" non-coding sequences" refer to nucleotide sequences located downstream of a 
coding sequence and include polyadenylation recognition sequences and other sequences 
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encoding regulatory signals capable of affecting mRNA processing or gene expression. The 
polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid 
tracts to the 3' end of the mRNA precursor. The use of different 3' non-coding sequences is 
exemplified by Ingelbrecht et al. (1989) Plant Cell 7:671-680. 

"RNA transcript" refers to the product resulting from RNA polymerase-catalyzed 
transcription of a DNA sequence. When the RNA transcript is a perfect complementary 
copy of the DNA sequence, it is referred to as the primary transcript or it may be a RNA 
sequence derived from posttranscriptional processing of the primary transcript and is 
referred to as the mature RNA. "Messenger RNA (mRNA)" refers to the RNA that is 
without introns and that can be translated into polypeptide by the cell. "cDNA" refers to a 
double-stranded DNA that is complementary to and derived from mRNA. "Sense" RNA 
refers to an RNA transcript that includes the mRNA and so can be translated into a 
polypeptide by the cell. "Antisense RNA" refers to an RNA transcript that is 
complementary to all or part of a target primary transcript or mRNA and that blocks the 
expression of a target gene (see U.S. Patent No. 5,107,065, incorporated herein by 
reference). The complementarity of an antisense RNA may be with any part of the specific 
nucleotide sequence, i.e., at the 5' non-coding sequence, 3* non-coding sequence, introns, or 
the coding sequence. "Functional RNA" refers to sense RNA, antisense RNA, ribozyme 
RNA, or other RNA that may not be translated but yet has an effect on cellular processes. 

The term "operably linked" refers to the association of two or more nucleic acid 
fragments on a single nucleic acid fragment so that the function of one is affected by the 
other. For example, a promoter is operably linked with a coding sequence when it is capable 
of affecting the expression of that coding sequence (i.e., that the coding sequence is under 
the transcriptional control of the promoter). Coding sequences can be operably linked to 
regulatory sequences in sense or antisense orientation. 

The term "expression", as used herein, refers to the transcription and stable 
accumulation of sense (mRNA) or antisense RNA derived from the nucleic acid fragment of 
the invention. Expression may also refer to translation of mRNA into a polypeptide. 
"Antisense inhibition" refers to the production of antisense RNA transcripts capable of 
suppressing the expression of the target protein. "Overexpression" refers to the production 
of a gene product in transgenic organisms that exceeds levels of production in normal or 
non-transformed organisms. "Co-suppression" refers to the production of sense RNA 
transcripts capable of suppressing the expression of identical or substantially similar foreign 
or endogenous genes (U.S. Patent No. 5,231,020. incorporated herein by reference). 

"Altered levels" refers to the production of gene product(s) in transgenic organisms in 
amounts or proportions that differ from that of normal or non-transformed organisms. 

"Mature" protein refers to a post-translationally processed polypeptide; i.e., one from 
which any pre- or propeptides present in the primary translation product have been removed. 
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"Precursor" protein refers to the primary product of translation of mRNA; i.e., with pre- and 
propeptides still present. Pre- and propeptides may be but are not limited to intracellular 
localization signals. 

A "chloroplast transit peptide" is an amino acid sequence which is translated in 
5 conjunction with a protein and directs the protein to the chloroplast or other plastid types 
present in the cell in which the protein is made. "Chloroplast transit sequence" refers to a 
nucleotide sequence that encodes a chloroplast transit peptide. A "signal peptide" is an 
amino acid sequence which is translated in conjunction with a protein and directs the protein 
to the secretory system (Chrispeels (1991) Ann. Rev. Plant Phys. Plant Mol. Biol. 42:21-53). 
10 If the protein is to be directed to a vacuole, a vacuolar targeting signal (supra) can further be 
added, or if to the endoplasmic reticulum, an endoplasmic reticulum retention signal (supra) 
may be added. If the protein is to be directed to the nucleus, any signal peptide present 
should be removed and instead a nuclear localization signal included (Raikhel (1992) Plant 
Phys. 700:1627-1632). 

15 "Transformation" refers to the transfer of a nucleic acid fragment into the genome of a 

host organism, resulting in genetically stable inheritance. Host organisms containing the 
transformed nucleic acid fragments are referred to as "transgenic" organisms. Examples of 
methods of plant transformation include Agrobacterium-mediated transformation (De Blaere 
et al. (1987) Meth. Enzymol. 143:277) and particle-accelerated or "gene gun" transformation 
20 technology (Klein et al. (1 987) Nature (London) 527:70-73; U.S. Patent No. 4,945,050, 
incorporated herein by reference). 

Standard recombinant DNA and molecular cloning techniques used herein are well 
known in the art and are described more fully in Sambrook et al. Molecular Cloning: A 
Laboratory Manual; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 1989 
25 (hereinafter "Maniatis"). 

Nucleic acid fragments encoding at least a portion of a cellulose synthase enzyme have 
been isolated and identified by comparison of random plant cDNA sequences to public 
databases containing nucleotide and protein sequences using the BLAST algorithms well 
known to those skilled in the art. The nucleic acid fragments of the instant invention may be 
30 used to isolate cDNAs and genes encoding homologous proteins from the same or other 
plant species. Isolation of homologous genes using sequence-dependent protocols is well 
known in the art. Examples of sequence-dependent protocols include, but are not limited to, 
methods of nucleic acid hybridization, and methods of DNA and RNA amplification as 
exemplified by various uses of nucleic acid amplification technologies (e.g., polymerase 
35 chain reaction, ligase chain reaction). 

For example, genes encoding other cellulose synthase enzymes, either as cDNAs or 
genomic DNAs, could be isolated directly by using all or a portion of the instant nucleic acid 
fragments as DNA hybridization probes to screen libraries from any desired plant employing 
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methodology well known to those skilled in the art. Specific oligonucleotide probes based 
upon the instant nucleic acid sequences can be designed and synthesized by methods known 
in the art (Maniatis). Moreover, the entire sequences can be used directly to synthesize 
DNA probes by methods known to the skilled artisan such as random primer DNA labeling, 
nick translation, or end-labeling techniques, or RNA probes using available in vitro 
transcription systems. In addition, specific primers can be designed and used to amplify a 
part or all of the instant sequences. The resulting amplification products can be labeled 
directly during amplification reactions or labeled after amplification reactions, and used as 
probes to isolate full length cDNA or genomic fragments under conditions of appropriate 
stringency. 

In addition, two short segments of the instant nucleic acid fragments may be used in 
polymerase chain reaction protocols to amplify longer nucleic acid fragments encoding 
homologous genes from DNA or RNA. The polymerase chain reaction may also be 
performed on a library of cloned nucleic acid fragments wherein the sequence of one primer 
is derived from the instant nucleic acid fragments, and the sequence of the other primer takes 
advantage of the presence of the polyadenylic acid tracts to the 3' end of the mRNA 
precursor encoding plant genes. Alternatively, the second primer sequence may be based 
upon sequences derived from the cloning vector. For example, the skilled artisan can follow 
the RACE protocol (Frohman et al. (1988) Proc. Natl. Acad. Sci. USA 5.5:8998) to generate 
cDNAs by using PCR to amplify copies of the region between a single point in the 
transcript and the 3' or 5' end. Primers oriented in the 3' and 5' directions can be designed 
from the instant sequences. Using commercially available 3' RACE or 5' RACE systems 
(BRL), specific 3' or 5' cDNA fragments can be isolated (Ohara et al. (1989) Proc. Natl. 
Acad. Sci. USA 86:5673; Loh et al. (1989) Science 243:217). Products generated by the 3' 
and 5' RACE procedures can be combined to generate full-length cDNAs (Frohman and 
Martin (1989) Techniques 7:165). 

Availability of the instant nucleotide and deduced amino acid sequences facilitates 
immunological screening of cDNA expression libraries. Synthetic peptides representing 
portions of the instant amino acid sequences may be synthesized. These peptides can be 
used to immunize animals to produce polyclonal or monoclonal antibodies with specificity 
for peptides or proteins comprising the amino acid sequences. These antibodies can be then 
be used to screen cDNA expression libraries to isolate full-length cDNA clones of interest 
(Lerner (19S4) Adv. Immunol. 36:1; Maniatis). 

The nucleic acid fragments of the instant invention may be used to create transgenic 
plants in which the disclosed polypeptides are present at higher or lower levels than normal 
or in cell types or developmental stages in which they are not normally found. This would 
have the effect of altering the level of cellulose synthase in those cells. 
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Overexpression of the proteins of the instant invention may be accomplished by first 
constructing a chimeric gene in which the coding region is operably linked to a promoter 
capable of directing expression of a gene in the desired tissues at the desired stage of 
development. For reasons of convenience, the chimeric gene may comprise promoter 
sequences and translation leader sequences derived from the same genes. 3' Non-coding 
sequences encoding transcription termination signals may also be provided. The instant 
chimeric gene may also comprise one or more introns in order to facilitate gene expression. 

Plasmid vectors comprising the instant chimeric gene can then constructed. The 
choice of plasmid vector is dependent upon the method that will be used to transform host 
plants. The skilled artisan is well aware of the genetic elements that must be present on the 
plasmid vector in order to successfully transform, select and propagate host cells containing 
the chimeric gene. The skilled artisan will also recognize that different independent 
transformation events will result in different levels and patterns of expression (Jones et al. 
{mS)EMBOJ. 4:2411-2418; De Almeida et al. (1989) Mol. Gen. Genetics 21 5:78-86), and 
thus that multiple events must be screened in order to obtain lines displaying the desired' 
expression level and pattern. Such screening may be accomplished by Southern analysis of 
DNA, Northern analysis of mRNA expression, Western analysis of protein expression, or 
phenotypic analysis. 

For some applications it may be useful to direct the instant polypeptides to different 
cellular compartments, or to facilitate its secretion from the cell. It is thus envisioned that 
the chimeric gene described above may be further supplemented by altering the coding 
sequence to encode the instant polypeptides with appropriate intracellular targeting 
sequences such as transit sequences (Keegstra (1989) Cell 56:247-253), signal sequences or 
sequences encoding endoplasmic reticulum localization (Chrispeels (1991) Ann. Rev. Plant 
Phys. Plant Mol. Biol. 42:21-53), or nuclear localization signals (Raikhel (1992) Plant 
Phys. 1 00:1627-1 632) added and/or with targeting sequences that are already present 
removed. While the references cited give examples of each of these, the list is not 
exhaustive and more targeting signals of utility may be discovered in the future. 

It may also be desirable to reduce or eliminate expression of genes encoding the 
instant polypeptides in plants for some applications. In order to accomplish this, a chimeric 
gene designed for co-suppression of the instant polypeptide can be constructed by linking a 
gene or gene fragment encoding that polypeptide to plant promoter sequences. 
Alternatively, a chimeric gene designed to express antisense RNA for all or part of the 
instant nucleic acid fragment can be constructed by linking the gene or gene fragment in 
reverse orientation to plant promoter sequences. Either the co-suppression or antisense 
chimeric genes could be introduced into plants via transformation wherein expression of the 
corresponding endogenous genes are reduced or eliminated. 
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Molecular genetic solutions to the generation of plants with altered gene expression 
have a decided advantage over more traditional plant breeding approaches. Changes in plant 
phenotypes can be produced by specifically inhibiting expression of one or more genes by 
antisense inhibition or cosuppression (U. S. Patent Nos. 5,190,931, 5,107,065 and 
5,283,323). An antisense or cosuppression construct would act as a dominant negative 
regulator of gene activity. While conventional mutations can yield negative regulation of 
gene activity these effects are most likely recessive. The dominant negative regulation 
available with a transgenic approach may be advantageous from a breeding perspective. In 
addition, the ability to restrict the expression of specific phenotype to the reproductive 
tissues of the plant by the use of tissue specific promoters may confer agronomic 
advantages relative to conventional mutations which may have an effect in all tissues in 
which a mutant gene is ordinarily expressed. 

The person skilled in the art will know that special considerations are associated with 
the use of antisense or cosuppresion technologies in order to reduce expression of particular 
genes. For example, the proper level of expression of sense or antisense genes may require 
the use of different chimeric genes utilizing different regulatory elements known to the 
skilled artisan. Once transgenic plants are obtained by one of the methods described above, 
it will be necessary to screen individual transgenics for those that most effectively display 
the desired phenotype. Accordingly, the skilled artisan will develop methods for screening 
large numbers of transformants. The nature of these screens will generally be chosen on 
practical grounds, and is not an inherent part of the invention. For example, one can screen 
by looking for changes in gene expression by using antibodies specific for the protein 
encoded by the gene being suppressed, or one could establish assays that specifically 
measure enzyme activity. A preferred method will be one which allows large numbers of 
samples to be processed rapidly, since it will be expected that a large number of 
transformants will be negative for the desired phenotype. 

The instant polypeptides (or portions thereof) may be produced in heterologous host 
cells, particularly in the cells of microbial hosts, and can be used to prepare antibodies to the 
these proteins by methods well known to those skilled in the art. The antibodies are useful 
for detecting the polypeptides of the instant invention in situ in cells or in vitro in cell 
extracts. Preferred heterologous host cells for production of the instant polypeptides are 
microbial hosts. Microbial expression systems and expression vectors containing regulatory 
sequences that direct high level expression of foreign proteins are well known to those 
skilled in the art. Any of these could be used to construct a chimeric gene for production of 
the instant polypeptides. This chimeric gene could then be introduced into appropriate 
microorganisms via transformation to provide high level expression of the encoded cellulose 
synthase. An example of a vector for high level expression of the instant polypeptides in a 
bacterial host is provided (Example 6). 
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Additionally, the instant polypeptides can be used as a targets to facilitate design 
and/or identification of inhibitors of those enzymes that may be useful as herbicides. This is 
desirable because the polypeptides described herein catalyze a step in the synthesis of 
cellulose. Accordingly, inhibition of the activity of one or more of the enzymes described 
herein could lead to inhibition plant growth. Thus, the instant polypeptides could be 
appropriate for new herbicide discovery and design. 

All or a substantial portion of the nucleic acid fragments of the instant invention may 
also be used as probes for genetically and physically mapping the genes that they are a part 
of, and as markers for traits linked to those genes. Such information may be useful in plant 
breeding in order to develop lines with desired phenotypes. For example, the instant nucleic 
acid fragments may be used as restriction fragment length polymorphism (RFLP) markers. 
Southern blots (Maniatis) of restriction-digested plant genomic DNA may be probed with 
the nucleic acid fragments of the instant invention. The resulting banding patterns may then 
be subjected to genetic analyses using computer programs such as MapMaker (Lander et al. 
(1987) Genomics 7:174-181) in order to construct a genetic map. In addition, the nucleic 
acid fragments of the instant invention may be used to probe Southern blots containing 
restriction endonuclease-treated genomic DNAs of a set of individuals representing parent 
and progeny of a defined genetic cross. Segregation of the DNA polymorphisms is noted 
and used to calculate the position of the instant nucleic acid sequence in the genetic map 
previously obtained using this population (Botstein et al. (1980) Am. J. Hum. Genet. 
32:314-331). 

The production and use of plant gene-derived probes for use in genetic mapping is 
described in Bernatzky and Tanksley (1986) Plant Mol Biol Reporter 4(1) ;37-41. 
Numerous publications describe genetic mapping of specific cDNA clones using the 
methodology outlined above or variations thereof. For example, F2 intercross populations, 
backcross populations, randomly mated populations, near isogenic lines, and other sets of 
individuals may be used for mapping. Such methodologies are well known to those skilled 
in the art. 

Nucleic acid probes derived from the instant nucleic acid sequences may also be used 
for physical mapping (i.e., placement of sequences on physical maps; see Hoheisel et al. In: 
Nonmammalian Genomic Analysis: A Practical Guide, Academic press 1996, pp. 319-346, 
and references cited therein). 

In another embodiment, nucleic acid probes derived from the instant nucleic acid 
sequences may be used in direct fluorescence in situ hybridization (FISH) mapping (Trask 
(1991) Trends Genet. 7:149-154). Although current methods of FISH mapping favor use of 
large clones (several to several hundred KB; see Laan et al. (1995) Genome Research 
5:13-20), improvements in sensitivity may allow performance of FISH mapping using 
shorter probes. 
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A variety of nucleic acid amplification-based methods of genetic and physical 
mapping may be carried out using the instant nucleic acid sequences. Examples include 
allele-specific amplification (Kazazian (1989) J. Lab. Clin. Med. 1 1 4(2):95-96), 
polymorphism of PCR-amplified fragments (CAPS; Sheffield et al. (1993) Genomics 
75:325-332), allele-specific ligation (Landegren et al. (1988) Science 241: 1077- 1080), 
nucleotide extension reactions (Sokolov (1990) Nucleic Acid Res. 75:3671 ), Radiation 
Hybrid Mapping (Walter et al. (1997) Nature Genetics 7:22-28) and Happy Mapping (Dear 
and Cook (1989) Nucleic Acid Res. 77:6795-6807). For these methods, the sequence of a 
nucleic acid fragment is used to design and produce primer pairs for use in the amplification 
reaction or in primer extension reactions. The design of such primers is well known to those 
skilled in the art. In methods employing PCR-based genetic mapping, it may be necessary 
to identify DNA sequence differences between the parents of the mapping cross in the 
region corresponding to the instant nucleic acid sequence. This, however, is generally not 
necessary for mapping methods. 

Loss of function mutant phenotypes may be identified for the instant cDNA clones 
either by targeted gene disruption protocols or by identifying specific mutants for these 
genes contained in a maize population carrying mutations in all possible genes (Ballinger 
and Benzer (1 989) Proc. Natl. Acad. Sci USA 86:9402; Koes et al. (1 995) Proc. Natl. Acad. 
Sci USA 92.SU9; Bensen et al. (1995) Plant Cell 7:75). The latter approach may be 
accomplished in two ways. First, short segments of the instant nucleic acid fragments may 
be used in polymerase chain reaction protocols in conjunction with a mutation tag sequence 
primer on DNAs prepared from a population of plants in which Mutator transposons or some 
other mutation-causing DNA element has been introduced (see Bensen, supra). The 
amplification of a specific DNA fragment with these primers indicates the insertion of the 
mutation tag element in or near the plant gene encoding the instant polypeptides. 
Alternatively, the instant nucleic acid fragment may be used as a hybridization probe against 
PCR amplification products generated from the mutation population using the mutation tag 
sequence primer in conjunction with an arbitrary genomic site primer, such as that for a 
restriction enzyme site-anchored synthetic adaptor. With either method, a plant containing a 
mutation in the endogenous gene encoding the instant polypeptides can be identified and 
obtained. This mutant plant can then be used to determine or confirm the natural function of 
the instant polypeptides disclosed herein. 

EXAMPLES 

The present invention is further defined in the following Examples, in which all parts 
and percentages are by weight and degrees are Celsius, unless otherwise stated. It should be 
understood that these Examples, while indicating preferred embodiments of the invention, 
are given by way of illustration only. From the above discussion and these Examples, one 
skilled in the art can ascertain the essential characteristics of this invention, and without 

14 



WO 00/04166 



PCMJS99/15871 



departing from the spirit and scope thereof, can make various changes and modifications of 
the invention to adapt it to various usages and conditions. 

EXAMPLE 1 

Composition of cDNA Libraries: Isol ation and Sequencing of cDNA Clones 
5 cDNA libraries representing mRNAs from various barley, corn, rice, soybean and 

wheat tissues were prepared. The characteristics of the libraries are described below. 

TABLE 2 

cDNA Libraries from Barley, Com, Rice, Soybean and Wheat 



Library 



Tissue 



bshl 
ccoln 

cdt2c 

crln 
csclc 

p0031 

pOUO 

P 0097 
p0122 
rlr24 

sdp2c 
ses8w 

ssl 
wll 

wlln 

wrl 



Clone 



Barley (Hordeum vulgare) sheath, developing seedling 

Com (Zea mays) cob of 67 day old plants grown in green 
house* 

Com (Zea mays) developing tassel 2 

Com (Zea mays) root from 7 day seedlings grown in light* 

Com (Zea mays) 20 day seedling (germination under cold 
stress) 

Corn (Zea mays) shoot culture, initiated from seed derived 
meristems culture was maintained on 273N medium. 
Corn (Zea mays) stages V3/V4** leaf tissue minus midrib 
harvested 4 hours, 24 hours and 7 days after infiltration 
with salicylic acid, tissues pooled* 

Corn (Zea mays) stage V9** whorl section (7 cm) from 
plant infected four times with european com borer 
Com (Zea mays) pith tissue collected from internode 
subtending ear node 5 days after pollenation 
Rice (Oryza sativa) leaf (15 days after germination) 
24 hours after infection of Magaporthe grisea strain 
4360-R-62 (AVR2-YAMO); Resistant 

Soybean (Glycine max) developing pods 6-7 mm 

Soybean (Glycine max) mature embryo 8 weeks after 
subculture 

Soybean (Glycine max) seedling 5-10 day 

Wheat (Triticum aestivum) leaf 7 day old seedling, light 
grown 

Wheat (Triticum aestivum) leaf 7 day old seedling, light 
grown* 

Wheat (Triticum aestivum) root; 7 day old seedling, light 
grown 



bshl.pk0002.f6 
ccoln.pk0005.g3 

cdt2c.pk002.gl 
cdt2c.pk002.116 
crln.pk0135.el0 
csclc.pk002.il 

p0031.ccmar05rb 

p0110.cgsma57r 

p0097.cqradl7rc 

p0122.ckamh70rc 

rlr24.pk0073.gl 

sdp2c.pk005.o22 
ses8w.pk0028.G 

ssl.pk0036.cl0 
wll.pk0009.c9 

wlln.pk0044.bl 

wrl.pk0160.dll 
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Libraf y Tissue . Clone 

wreln Wheat {Triticum aestivum) root: 7 day old etiolated wreln.pk0043.f9 
seedling* 

wreln.pk0043.h8 

__ _ wreln.pk0131.gl0 

♦These libraries were normalized essentially as described in U.S. Patent No. 5,482,845, 
incorporated herein by reference. 

**V3, V4 and V9 refer to stages of corn growth. The descriptions can be found in "How 
5 a Corn Plant Develops" Special Report No. 48, Iowa State University of Science and 
Technology Cooperative Extension Service Amesjowa, Reprinted February 1993. 

cDNA libraries may be prepared by any one of many methods available. For 
example, the cDNAs may be introduced into plasmid vectors by first preparing the cDNA 

10 libraries in Uni-ZAP™ XR vectors according to the manufacturer's protocol (Stratagene 
Cloning Systems, La Jolla, CA). The Uni-ZAP™ XR libraries are converted into plasmid 
libraries according to the protocol provided by Stratagene. Upon conversion, cDNA inserts 
will be contained in the plasmid vector pBluescript. In addition, the cDNAs may be 
introduced directly into precut Bluescript II SK(+) vectors (Stratagene) using T4 DNA 

15 ligase (New England Biolabs), followed by transfection into DH10B cells according to the 
manufacturer's protocol (GIBCO BRL Products). Once the cDNA inserts are in plasmid 
vectors, plasmid DNAs are prepared from randomly picked bacterial colonies containing 
recombinant pBluescript plasmids, or the insert cDNA sequences are amplified via 
polymerase chain reaction using primers specific for vector sequences flanking the inserted 

20 cDNA sequences. Amplified insert DNAs or plasmid DNAs are sequenced in dye-primer 
sequencing reactions to generate partial cDNA sequences (expressed sequence tags or 
"ESTs !? ; see Adams et al., (1991) Science 252:1651). The resulting ESTs are analyzed using 
a Perkin Elmer Model 377 fluorescent sequencer. 

EXAMPLE 2 

25 Identification of cDNA Clone* 

cDNA clones encoding cellulose synthase enzymes were identified by conducting 
BLAST (Basic Local Alignment Search Tool; Altschul et al. (1993) J. Mol Biol 
275:403-410; see also www.ncbi.nlm.nih.gov/BLAST/) searches for similarity to sequences 
contained in the BLAST "nr" database (comprising all non-redundant GenBank CDS 

30 translations, sequences derived from the 3 -dimensional structure Brookhaven Protein Data 
Bank, the last major release of the S WISS-PROT protein sequence database, EMBL, and 
DDBJ databases). The cDNA sequences obtained in Example 1 were analyzed for similarity 
to all publicly available DNA sequences contained in the "nr" database using the BLASTN 
algorithm provided by the National Center for Biotechnology Information (NCBI). The 
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DNA sequences were translated in all reading frames and compared for similarity to all 
publicly available protein sequences contained in the "nr" database using the BLASTX 
algorithm (Gish and States (1993) Nature Genetics J:266-272) provided by the NCBI. For 
convenience, the P-value (probability) of observing a match of a cDNA sequence to a 
sequence contained in the searched databases merely by chance as calculated by BLAST are 
reported herein as "pLog" values, which represent the negative of the logarithm of the 
reported P-value. Accordingly, the greater the pLog value, the greater the likelihood that the 
cDNA sequence and the BLAST "hit" represent homologous proteins. 

EXAMPLE 3 

Characterization of cDNA Clone?; Encoding Cellulose Sy nthase 
The BLASTX search using the EST sequences from clones listed in Table 3 revealed 
similarity of the polypeptides encoded by the cDNAs to cellulose synthase from Arabidopsis 
thaliana (NCBI Identifier No. gi 2827139, gi 2827141, gi 4467125, gi 4886756 and gi 
313561 1) and Gossypium hirsutum (NCBI Identifier No. gi 1706958 and 5081779). Shown 
in Table 3 are the BLAST results for individual ESTs ("EST"), the sequences of the entire 
cDNA inserts comprising the indicated cDNA clones ("FIS"), complete gene sequences 
("CGS") or contigs assembled from two or more ESTs ("Contig"): 

TABLE 3 



BLAST Results for Sequences Encoding Polypeptides Homologous 
to Arabidops is thaliana and Gossypium hirsutum Cellulose Synthase 



Clone 


Status 


BLAST pLog Score 


bshl.pk0002.f6 


FIS 


1 54.00 (gi 2827139) 


Contig composed of: 


Contig 


>254.00(gi 2827141) 


ccoln.pk0005.g3 




cdt2c.pk002.gl 






cdt2c.pk002.116 






csclc.pk002.il 






p0031.ccmar05rb 






p0110.cgsma57r 






crln.pk0135.el0 


FIS 


1 76.00 (gi 1706958) 


p0097.cqradl7rc 


CGS 


>254.00 (gi 2827141) 


p0122.ckamh70rc 


CGS 


>254.00 (gi 2827141) 


rlr24.pk0073.gl 


EST 


77.70 (gi 4467125) 


sdp2c.pk005.o22 


FIS 


>254.00 (gi 4886756) 


ses8w.pk0028.O 


EST 


>254.00(gi 2827139) 


ssl.pk0036.cl0 


EST 


>254.00(gi 2827141) 


Contig composed of: 


Contig 


>254.00(gi 5081779) 


wll.pk0009.c9 




wrl.pk0160.dll 






wreln.pk0043.f9 
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wreln.pk0043.h8 " 

wreln.pk0131.gl0 

wlln.pk0044.bl EST 166.00 (gi 3135611) 

Figure 1 presents an alignment of the amino acid sequences set forth in SEQ ID 
NOs:2, 4, 8, 10, 12, 14, 16, 1 8, 20 and 22 and the Arabidopsis thaliana (SEQ ID NOs:23 
(gi 2827139), 24 (gi 2827141), 26 (gi 4467125), 27 (gi 4886756) and 29 (gi 313561 1) ) and 
Gossypium hirsutum (SEQ ID NOs:25 (gi 1706958) and 28 (gi 5081779)) sequences. The 
data in Table 4 represents a calculation of the percent identity of the amino acid sequences 
set forth in SEQ ID NOs:2, 4, 8, 10, 12, 14, 16, 18, 20 and 22 and the Arabidopsis thaliana 
(SEQ ID NOs:23, 24, 26, 27 and 29) and Gossypium hirsutum (SEQ ID NOs:25 and 28) 
sequences. 

TABLE 4 

Percent Identity of Amino Acid Sequences Deduced From the Nucleotide Sequences 
of cDNA Clones Encoding Polypeptides Homologous 



SEQ ID NO. 


Percent Identity to 


2 


82% (gi 2827139) 


4 


69% (gi 2827141) 


6 


89% (gi 1706958) 


8 


70% (gi 2827141) 


10 


70% (gi 2827141) 


12 


36% (gi 4467125) 


14 


86% (gi 4886756) 


16 


88% (gi 2827139) 


18 


86% (gi 2827141) 


20 


87% (gi 5081779) 


22 


70% (gi 3135611) 



Sequence alignments and percent identity calculations were performed using the 
Megalign program of the LASARGENE bioinformatics computing suite (DNASTAR Inc., 
Madison, WI). Multiple alignment of the sequences was performed using the Clustal 
method of alignment (Higgins and Sharp (1989) CABIOS. 5:151-153) with the default 
parameters (GAP PENALTY=10, GAP LENGTH PENALTY=10). Default parameters for 
pairwise alignments using the Clustal method were KTUPLE 1 , GAP PENALTY=3, 
WTNDOW=5 and DIAGONALS SAVED=5. Sequence alignments and BLAST scores and 
probabilities indicate that the nucleic acid fragments comprising the instant cDNA clones 
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encode a substantial portion of a cellulose synthase. These sequences represent the first 
barley, corn, rice, soybean and wheat sequences encoding cellulose synthase. 

EXAMPLE 4 
Expressio n of Chimeric Genes in Monocot Cells 
A chimeric gene comprising a cDNA encoding the instant polypeptides in sense 
orientation with respect to the maize 27 kD zein promoter that is located 5' to the cDNA 
fragment, and the 10 kD zein 3" end that is located 3' to the cDNA fragment, can be 
constructed. The cDNA fragment of this gene may be generated by polymerase chain 
reaction (PCR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites 
(Ncol or Smal) can be incorporated into the oligonucleotides to provide proper orientation 
of the DNA fragment when inserted into the digested vector pML103 as described below. 
Amplification is then performed in a standard PCR. The amplified DNA is then digested 
with restriction enzymes Ncol and Smal and fractionated on an agarose gel. The appropriate 
band can be isolated from the gel and combined with a 4.9 kb Ncol-Smal fragment of the 
ptasmid pML103. Plasmid pML103 has been deposited under the terms of the Budapest 
Treaty at ATCC (American Type Culture Collection, 10801 University Blvd., Manassas, 
VA 201 10-2209), and bears accession number ATCC 97366. The DNA segment from 
pML103 contains a 1.05 kb Sall-Ncol promoter fragment of the maize 27 kD zein gene and 
a 0.96 kb Smal-Sall fragment from the 3' end of the maize 10 kD zein gene in the vector 
pGem9Zf(+) (Promega). Vector and insert DNA can be ligated at 15°C overnight, 
essentially as described (Maniatis). The ligated DNA may then be used to transform E. coli 
XLl-Blue (Epicurian Coli XL-1 Blue™; Stratagene). Bacterial transformants can be 
screened by restriction enzyme digestion of plasmid DNA and limited nucleotide sequence 
analysis using the dideoxy chain termination method (Sequenase™ DNA Sequencing Kit; 
U.S. Biochemical). The resulting plasmid construct would comprise a chimeric gene 
encoding, in the 5' to 3' direction, the maize 27 kD zein promoter, a cDNA fragment 
encoding the instant polypeptides, and the 10 kD zein 3' region. 

The chimeric gene described above can then be introduced into corn cells by the 
following procedure. Immature corn embryos can be dissected from developing caryopses 
derived from crosses of the inbred corn lines H99 and LH132. The embryos are isolated 10 
to 1 1 days after pollination when they are 1 .0 to 1 .5 mm long. The embryos are then placed 
with the axis-side facing down and in contact with agarose-solidified N6 medium (Chu et al. 
(1975) Sci. Sin. Peking 18:659-668). The embryos are kept in the dark at 27°C. Friable 
embryogenic callus consisting of undifferentiated masses of cells with somatic 
proembryoids and embryoids bome on suspensor structures proliferates from the scutellum 
of these immature embryos. The embryogenic callus isolated from the primary explant can 
be cultured on N6 medium and sub-cultured on this medium every 2 to 3 weeks. 
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The plasmid, p35S/Ac (obtained from Dr. Peter Eckes, Hoechst Ag, Frankfurt, 
Germany) may be used in transformation experiments in order to provide for a selectable 
marker. This plasmid contains the Pat gene (see European Patent Publication 0 242 236) 
which encodes phosphinothricin acetyl transferase (PAT). The enzyme PAT confers 
5 resistance to herbicidal glutamine synthetase inhibitors such as phosphinothricin. The pat 
gene in p35S/Ac is under the control of the 35S promoter from Cauliflower Mosaic Virus 
(Odell et ah (1985) Nature 313:810-812) and the 3' region of the nopaline synthase gene 
from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. 

The particle bombardment method (Klein et al. (1987) Nature 327:70-73) may be used 

10 to transfer genes to the callus culture cells. According to this method, gold particles (1 \im 
in diameter) are coated with DNA using the following technique. Ten \xg of plasmid DNAs 
are added to 50 ^iL of a suspension of gold particles (60 mg per mL). Calcium chloride 
(50 ^iL of a 2.5 M solution) and spermidine free base (20 of a 1.0 M solution) are added 
to the particles. The suspension is vortexed during the addition of these solutions. After 

15 10 minutes, the tubes are briefly centrifuged (5 sec at 1 5,000 rpm) and the supernatant 
removed. The particles are resuspended in 200 \xL of absolute ethanol, centrifuged again 
and the supernatant removed. The ethanol rinse is performed again and the particles 
resuspended in a final volume of 30 of ethanol. An aliquot (5 nL) of the DNA-coated 
gold particles can be placed in the center of a Kapton™ flying disc (Bio-Rad Labs). The 

20 particles are then accelerated into the corn tissue with a Biolistic™ PDS- 1 000/He (Bio-Rad 
Instruments, Hercules CA), using a helium pressure of 1000 psi, a gap distance of 0.5 cm 
and a flying distance of 1 .0 cm. 

For bombardment, the embryogenic tissue is placed on filter paper over agarose- 
solidified N6 medium. The tissue is arranged as a thin lawn and covered a circular area of 

25 about 5 cm in diameter. The petri dish containing the tissue can be placed in the chamber of 
the PDS- 1 000/He approximately 8 cm from the stopping screen. The air in the chamber is 
then evacuated to a vacuum of 28 inches of Hg. The macrocarrier is accelerated with a 
helium shock wave using a rupture membrane that bursts when the He pressure in the shock 
tube reaches 1 000 psi. 

30 Seven days after bombardment the tissue can be transferred to N6 medium that 

contains gluphosinate (2 mg per liter) and lacks casein or proline. The tissue continues to 
grow slowly on this medium. After an additional 2 weeks the tissue can be transferred to 
fresh N6 medium containing gluphosinate. After 6 weeks, areas of about 1 cm in diameter 
of actively growing callus can be identified on some of the plates containing the glufosinate- 

35 supplemented medium. These calli may continue to grow when sub-cultured on the 
selective medium. 

Plants can be regenerated from the transgenic callus by first transferring clusters of 
tissue to N6 medium supplemented with 0.2 mg per liter of 2,4-D. After two weeks the 
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tissue can be transferred to regeneration medium (Fromm et al. (1990) Bio/Technology 
5:833-839). 

EXAMPLE 5 
Expression of Chim eric Genes in Dicot Cells 
A seed-specific expression cassette composed of the promoter and transcription 
terminator from the gene encoding the p subunit of the seed storage protein phaseolin from 
the bean Phaseolus vulgaris (Doyle et al. (1986) J. Biol. Chem. 261 :9228-9238) can be used 
for expression of the instant polypeptides in transformed soybean. The phaseolin cassette 
includes about 500 nucleotides upstream (5') from the translation initiation codon and about 
1650 nucleotides downstream (3') from the translation stop codon of phaseolin. Between the 
5' and 3* regions are the unique restriction endonuclease sites Nco I (which includes the ATG 
translation initiation codon), Sma I, Kpn I and Xba I. The entire cassette is flanked by 
Hind III sites. 

The cDNA fragment of this gene may be generated by polymerase chain reaction 
(PCR) of the cDNA clone using appropriate oligonucleotide primers. Cloning sites can be 
incorporated into the oligonucleotides to provide proper orientation of the DNA fragment 
when inserted into the expression vector. Amplification is then performed as described 
above, and the isolated fragment is inserted into a pUC18 vector carrying the seed 
expression cassette. 

Soybean embroys may then be transformed with the expression vector comprising 
sequences encoding the instant polypeptides. To induce somatic embryos, cotyledons, 
3-5 mm in length dissected from surface sterilized, immature seeds of the soybean cultivar 
A2872, can be cultured in the light or dark at 26°C on an appropriate agar medium for 
6-10 weeks. Somatic embryos which produce secondary embryos are then excised and 
placed into a suitable liquid medium. After repeated selection for clusters of somatic 
embryos which multiplied as early, globular staged embryos, the suspensions are maintained 
as described below. 

Soybean embryogenic suspension cultures can maintained in 35 mL liquid media on a 
rotary shaker, 150 rpm, at 26°C with florescent lights on a 16:8 hour day/night schedule. 
Cultures are subcultured every two weeks by inoculating approximately 35 mg of tissue into 
35 mL of liquid medium. 

Soybean embryogenic suspension cultures may then be transformed by the method of 
particle gun bombardment (Klein et al. (1987) Nature (London) 527:70, U.S. Patent 
No. 4,945,050). A DuPont Biolistic™ PDS1000/HE instrument (helium retrofit) can be used 
for these transformations. 

A selectable marker gene which can be used to facilitate soybean transformation is a 
chimeric gene composed of the 35S promoter from Cauliflower Mosaic Virus (Odell et al. 
(1985) Nature i/J:810-812), the hygromycin phosphotransferase gene from plasmid pJR225 
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(from E. coli; Gritz et al.(1983) Gene 25:179-188) and the 3' region of the nopaline synthase 
gene from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. The seed expression 
cassette comprising the phaseolin 5' region, the fragment encoding the instant polypeptides 
and the phaseolin 3' region can be isolated as a restriction fragment. This fragment can then 
5 be inserted into a unique restriction site of the vector carrying the marker gene. 

To 50 nL of a 60 mg/mL 1 urn gold particle suspension is added (in order): 5 uL 
DNA (1 ug/uL), 20 ul spermidine (0.1 M), and 50 uL CaCl 2 (2.5 M). The particle 
preparation is then agitated for three minutes, spun in a microfuge for 10 seconds and the 
supernatant removed. The DNA-coated particles are then washed once in 400 uL 70% 
10 ethanol and resuspended in 40 jiL of anhydrous ethanol. The DNA/particle suspension can 
be sonicated three times for one second each. Five uL of the DNA-coated gold particles are 
then loaded on each macro carrier disk. 

Approximately 300-400 mg of a two-week-old suspension culture is placed in an 
empty 60x15 mm petri dish and the residual liquid removed from the tissue with a pipette. 
15 For each transformation experiment, approximately 5-10 plates of tissue are normally 

bombarded. Membrane rupture pressure is set at 1 100 psi and the chamber is evacuated to a 
vacuum of 28 inches mercury. The tissue is placed approximately 3.5 inches away from the 
retaining screen and bombarded three times. Following bombardment, the tissue can be 
divided in half and placed back into liquid and cultured as described above. 
20 Five to seven days post bombardment, the liquid media may be exchanged with fresh 

media, and eleven to twelve days post bombardment with fresh media containing 50 mg/mL 
hygromycin. This selective media can be refreshed weekly. Seven to eight weeks post 
bombardment, green, transformed tissue may be observed growing from untransformed, 
necrotic embryogenic clusters. Isolated green tissue is removed and inoculated into 
25 individual flasks to generate new, clonally propagated, transformed embryogenic suspension 
cultures. Each new line may be treated as an independent transformation event. These 
suspensions can then be subcultured and maintained as clusters of immature embryos or 
regenerated into whole plants by maturation and germination of individual somatic embryos. 

EXAMPLE 6 

30 Expression of Chimeric Genes in Microbial Cells 

The cDNAs encoding the instant polypeptides can be inserted into the T7 K coli 
expression vector pBT430. This vector is a derivative of pET-3a (Rosenberg et al. (1987) 
Gene 56:125-135) which employs the bacteriophage T7 RNA polymerase/T7 promoter 
system. Plasmid pBT430 was constructed by first destroying the EcoR I and Hind III sites in 

35 pET-3a at their original positions. An oligonucleotide adaptor containing EcoR I and 
Hind III sites was inserted at the BamH I site of pET-3a. This created pET-3aM with 
additional unique cloning sites for insertion of genes into the expression vector. Then, the 
Nde I site at the position of translation initiation was converted to an Nco I site using 
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oligonucleotide-directed mutagenesis. The DNA sequence of pET-3aM in this region, 
5'-CATATGG, was converted to 5'-CCCATGG in pBT430. 

Plasmid DNA containing a cDNA may be appropriately digested to release a nucleic 
acid fragment encoding the protein. This fragment may then be purified on a 1% NuSieve 
GTG™ low melting agarose gel (FMC). Buffer and agarose contain 10 ug/ml ethidium 
bromide for visualization of the DNA fragment. The fragment can then be purified from the 
agarose gel by digestion with GELase™ (Epicentre Technologies) according to the 
manufacturer's instructions, ethanol precipitated, dried and resuspended in 20 uL of water. 
Appropriate oligonucleotide adapters may be ligated to the fragment using T4 DNA ligase 
(New England Biolabs, Beverly, MA). The fragment containing the ligated adapters can be 
purified from the excess adapters using low melting agarose as described above. The vector 
pBT430 is digested, dephosphorylated with alkaline phosphatase (NEB) and deproteinized 
with phenol/chloroform as described above. The prepared vector pBT430 and fragment can 
then be ligated at 16°C for 15 hours followed by transformation into DH5 electrocompetent 
cells (GIBCO BRL). Transformants can be selected on agar plates containing LB media and 
100 ug/mL ampicillin. Transformants containing the gene encoding the instant polypeptides 
are then screened for the correct orientation with respect to the T7 promoter by restriction 
enzyme analysis. 

For high level expression, a plasmid clone with the cDNA insert in the correct 
orientation relative to the T7 promoter can be transformed into K coli strain BL21(DE3) 
(Studier et al. (1986) J. Mol. Biol. 7^:113-130). Cultures are grown in LB medium 
containing ampicillin (100 mg/L) at 25°C. At an optical density at 600 nm of approximately 
I, IPTG (isopropylthio-p-galactoside, the inducer) can be added to a final concentration of 
0.4 mM and incubation can be continued for 3 h at 25°. Cells are then harvested by 
centrifugation and re-suspended in 50 uL of 50 mM Tris-HCl at pH 8.0 containing 0.1 mM 
DTT and 0.2 mM phenyl methylsulfonyl fluoride. A small amount of 1 mm glass beads can 
be added and the mixture sonicated 3 times for about 5 seconds each time with a microprobe 
sonicator. The mixture is centrifuged and the protein concentration of the supernatant 
determined. One ug of protein from the soluble fraction of the culture can be separated by 
SDS-polyacrylamide gel electrophoresis. Gels can be observed for protein bands migrating 
at the expected molecular weight. 

EXAMPT.F. 7 
Evaluating Comp ounds for Thpj r Ability to Inhibit 
the Activity of Cell ulose Synthase 
The polypeptides described herein may be produced using any number of methods 
known to those skilled in the art. Such methods include, but are not limited to, expression in 
bacteria as described in Example 6, or expression in eukaryotic cell culture, in planta, and 
using viral expression systems in suitably infected organisms or cell lines. The instant 
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polypeptides may be expressed either as mature forms of the proteins as observed in vivo or 
as fusion proteins by covalent attachment to a variety of enzymes, proteins or affinity tags. 
Common fusion protein partners include glutathione S-transferase ("GST"), thioredoxin 
("Trx"), maltose binding protein, and C- and/or N-terminal hexahistidine polypeptide 
5 ("(His) 6 "). The fusion proteins may be engineered with a protease recognition site at the 
fusion point so that fusion partners can be separated by protease digestion to yield intact 
mature enzyme. Examples of such proteases include thrombin, enterokinase and factor Xa. 
However, any protease can be used which specifically cleaves the peptide connecting the 
fusion protein and the enzyme. 
10 Purification of the instant polypeptides, if desired, may utilize any number of 

separation technologies familiar to those skilled in the art of protein purification. Examples 
of such methods include, but are not limited to, homogenization, filtration, centrifugation, 
heat denaturation, ammonium sulfate precipitation, desalting, pH precipitation, ion exchange 
chromatography, hydrophobic interaction chromatography and affinity chromatography, 

15 wherein the affinity ligand represents a substrate, substrate analog or inhibitor. When the 
instant polypeptides are expressed as fusion proteins, the purification protocol may include 
the use of an affinity resin which is specific for the fusion protein tag attached to the 
expressed enzyme or an affinity resin containing ligands which are specific for the enzyme. 
For example, the instant polypeptides may be expressed as a fusion protein coupled to the 

20 C-terminus of thioredoxin. In addition, a (His) 6 peptide may be engineered into the 

N-terminus of the fused thioredoxin moiety to afford additional opportunities for affinity 
purification. Other suitable affinity resins could be synthesized by linking the appropriate 
ligands to any suitable resin such as Sepharose-4B. In an alternate embodiment, a 
thioredoxin fusion protein may be eluted using dithiothreitol; however, elution may be 

25 accomplished using other reagents which interact to displace the thioredoxin from the resin. 
These reagents include p-mercaptoethanol or other reduced thiol. The eluted fusion protein 
may be subjected to further purification by traditional means as stated above, if desired. 
Proteolytic cleavage of the thioredoxin fusion protein and the enzyme may be accomplished 
after the fusion protein is purified or while the protein is still bound to the ThioBond™ 

30 affinity resin or other resin. 

Crude, partially purified or purified enzyme, either alone or as a fusion protein, may be 
utilized in assays for the evaluation of compounds for their ability to inhibit enzymatic 
activation of the instant polypeptides disclosed herein. Assays may be conducted under well 
known experimental conditions which permit optimal enzymatic activity. For example, 

35 assays for cellulose synthase activity are presented in WO 98/1 8949 and WO 98/00549. 
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CLAIMS 

What is claimed is: 

1 . An isolated nucleic acid fragment comprising at least 900 nucleotides, wherein 
the nucleic acid fragment encodes a cellulose synthase comprising a member selected from 
the group consisting of: 

(a) an isolated nucleic acid fragment encoding an amino acid sequence that 
is at least 90% identical to the amino acid sequence set forth in a 
member selected from the group consisting of SEQ ID NO:2, 6, 12, 14, 
16, 18, 20 and 22; 

(b) an isolated nucleic acid fragment that is complementary to (a). 

2. The isolated nucleic acid fragment of Claim 1 wherein nucleic acid fragment is 
a functional RNA. 

3. The isolated nucleic acid fragment of Claim 1 wherein the nucleotide sequence 
of the fragment comprises the sequence set forth in a member selected from the group 
consisting ofSEQIDNO:l, 5, 11, 13, 15, 17, 19and21. 

4. A chimeric gene comprising the nucleic acid fragment of Claim 1 operably 
linked to suitable regulatory sequences. 

5. A transformed host cell comprising the chimeric gene of Claim 4. 

6. A cellulose synthase polypeptide comprising all or a substantial portion of the 
amino acid sequence set forth in a member selected from the group consisting of SEQ ID 
NO:2, 6, 12, 14, 16, 18, 20 and 22. 

7. An isolated nucleic acid fragment encoding a cellulose synthase comprising a 
member selected from the group consisting of: 

(a) an isolated nucleic acid fragment encoding an amino acid sequence that 
is functionally acitve polypeptide and at least 80% identical to the 
amino acid sequence set forth in a member selected from the group 
consisting of SEQ ID NO:4, 8 and 10 ; 

(b) an isolated nucleic acid fragment that is complementary to (a). 

8. The isolated nucleic acid fragment of Claim 7 wherein nucleic acid fragment is 
a functional RNA. 

9. The isolated nucleic acid fragment of Claim 7 wherein the nucleotide sequence 
of the fragment comprises the sequence set forth in a member selected from the group 
consisting of SEQ ID NO:3, 7 and 9. 

1 0. A chimeric gene comprising the nucleic acid fragment of Claim 7 operably 
linked to suitable regulatory sequences. 

11. A transformed host cell comprising the chimeric gene of Claim 1 0. 
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1 2. A cellulose synthase polypeptide comprising all or a substantial portion of the 
amino acid sequence set forth in a member selected from the group consisting of SEQ ID 
NO:4,8, 10. 

13. A method of altering the level of expression of a cellulose synthase in a host 
cell comprising: 

(a) transforming a host cell with the chimeric gene of any of Claims 4 and 
10; and 

(b) growing the transformed host cell produced in step (a) under conditions 
that are suitable for expression of the chimeric gene 

wherein expression of the chimeric gene results in production of altered levels of a cellulose 
synthase in the transformed host cell. 

14. A method of obtaining a nucleic acid fragment encoding all or a substantial 
portion of the amino acid sequence encoding a cellulose synthase comprising: 

(a) probing a cDNA or genomic library with the nucleic acid fragment of 
any of Claims 1 and 7; 

(b) identifying a DNA clone that hybridizes with the nucleic acid fragment 
any of of Claims land 7; 

(c) isolating the DNA clone identified in step (b); and 

(d) sequencing the cDNA or genomic fragment that comprises the clone 
isolated in step (c) 

wherein the sequenced nucleic acid fragment encodes all or a substantial portion of the 
amino acid sequence encoding a cellulose synthase. 

15. A method of obtaining a nucleic acid fragment encoding a substantial portion 
of an amino acid sequence encoding a cellulose synthase comprising: 

(a) synthesizing an oligonucleotide primer corresponding to a portion of 
the sequence set forth in any of SEQ ID NOs:l, 3, 5, 7, 9, 1 1, 13, 15, 
17, 19 and 21; and 

(b) amplifying a cDNA insert present in a cloning vector using the 
oligonucleotide primer of step (a) and a primer representing sequences 
of the cloning vector 

wherein the amplified nucleic acid fragment encodes a substantial portion of an amino acid 
sequence encoding a cellulose synthase. 

16. The product of the method of Claim 14. 

1 7. The product of the method of Claim 1 5 . 

1 8. A method for evaluating at least one compound for its ability to inhibit the 
activity of a cellulose synthase, the method comprising the steps of: 
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(a) transforming a host cell with a chimeric gene comprising a nucleic acid 
fragment encoding a cellulose synthase, operably linked to suitable 
regulatory sequences; 

(b) growing the transformed host cell under conditions that are suitable for 
expression of the chimeric gene wherein expression of the chimeric 
gene results in production of the cellulose synthase encoded by the 
operably linked nucleic acid fragment in the transformed host cell; 

(c) optionally purifying the cellulose synthase expressed by the 
transformed host cell; 

(d) treating the cellulose synthase with a compound to be tested; and 

(e) comparing the activity of the cellulose synthase that has been treated 
with a test compound to the activity of an untreated cellulose synthase, 

thereby selecting compounds with potential for inhibitory activity. 
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Figure 1 

1 60 
SEQ ID NO: 2 

SEQ ID NO: 4 RAAQAQRNKGKPQPEEQKLASVSLP — LPHSRFIPFPPRRRYRRR RTHACPG I 

SEQ ID NO: 6 

SEQ ID NO: 8 HSSYTKSRSSLAQPRAAPRQAQPPP — ATAACACERSPRPGDQRRGGLRAFRCAAAAGFV 

SEQ ID NO: 10 

SEQ ID NO: 12 RCS RRWTCSSPPPTPTRSRRSPRRTP 

SEQ ID NO: 14 

SEQ ID NO: 16 

SEQ ID NO: 18 

SEQ ID NO: 20 

SEQ ID NO: 22 

SEQ ID NO: 23 

SEQ ID NO: 24 MNTGGR ' 

SEQ ID NO: 25 

SEQ ID NO: 26 MASTPPQTSKKVRNNSGSGQTVKFARRTSSGRYVSLS-RDNIELSGELSGDYSNYTVHIP 

SEQ ID NO: 27 

SEQ ID NO: 28 Z~~~Z"I 

SEQ ID NO: 29 R PR * 



61 120 

SEQ ID NO: 2 

SEQ ID NO: 4 W-RSGSARG ME-ASAGLVAGSHNRNELV-VIRRDGEPGPKP— MDQRNGQVCQI — 

SEQ ID NO: 6 

SEQ ID NO: 8 RERDPAGRGGGPEME-ASAGLVAGSHNRNELV-VIRRDRESGAAGGGAARRAEAPCQI-- 

SEQ ID NO: 10 ME-ASAGLVAGSHNRNELV-VIRRDGDPGPKP — PREQNGQVCQI — 

SEQ ID NO: 12 _____ C _L__ 

SEQ ID NO: 14 ME-ASAGLVAGSHNRNELV-VIHGHEEP — KA — LKNLDGQVCEI — 

SEQ ID NO: 16 

SEQ ID NO: 18 "~ 

SEQ ID NO: 20 

SEQ ID NO: 22 

SEQ ID NO: 23 ME-ASAGLVAGSYRRNELV-RIRHESDGGTKP— LKNMNGQICQI— 

SEQ ID NO: 24 LIAGSHNRNEFV-LI — NADESARIRSVQELSGQTCQI — 

SEQ ID NO: 25 _______ 

SEQ ID NO: 26 PT PDNQ PMAT KAEEQYVSNS L FTGGFNS VTRAHLMDKVI DS DVT H PQMAGAKGSS CAMPA 

SEQ ID NO: 27 ME-ASAGLVAGSHNRNELV-VIHNHEEP--KP — LKNLDGQFCEI-- 

SEQ ID NO: 28 

SEQ ID NO: 29 LIAGSHNRNEFV-LI— NADENARIRS VQELSGQTCQI — 



SEQ ID NO: 2 



121 180 



SEQ ID NO: 4 CGDDVGRNPDGEPFVACNECAFPICRDCYEYERREGTQNCPQCKTRFKRLKGCARVPGD- 

SEQ ID NO: 6 

SEQ ID NO: 8 CGDEVGVGFDGE P FVACNECAFPVCRACYEYERREGSQAC PQCRTRYKRLKGCPRVAGD- 

SEQ ID NO: 10 CGDDVGLAPGGDPFVACNECAFPVCRDCYEYERREGTQNCPQCKTRYKRLKGCQRVTGD- 

SEQ ID NO: 12 CPY 

SEQ ID NO: 14 CGDGVGLTVDGDLFVACNECGFPVCRPCYEYERREGSHLCPQCKTRYKRLKGSPRVEGDD 

SEQ ID NO: 16 

SEQ ID NO: 18 Z1ZZZ 

SEQ ID NO: 20 ~~IZI 

SEQ ID NO:22 "II 

SEQ ID NO: 23 CGDDVGLAETGDVFVACNECAFPVCRPCYEYERKDGTQCCPQCKTRFRRHRGSPRVEGDE 

SEQ ID NO: 24 CGDEIELTVSSELFVACNECAFPVCRPCYEYERREGNQACPQCKTRYKRIKGSPRVDGDD 

SEQ ID NO: 25 

SEQ ID NO: 26 CDGNVMKDERGKDVMPC-ECRFKICRDCFMDAQKE-TGLCPGCKEQYK II__ 
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Figure 1 (cont'd.) 

SEQ ID NO: 27 CGDQIGLTVEGDLFVACNECGFPACRPCYEYERREGTQNCPQCKTRYKRLRGSPRVEGDE 

SEQ ID NO: 28 

SEQ ID NO: 29 CRDEIELTVDGEPFVACNECAFPVCRPCYEYERREGNQACPQCKTRFKRLKGSPRVEGD- 

181 240 

SEQ ID NO: 2 

SEQ ID NO: 4 EEEDGVDDLENEFNWSDK HDSQYLAESMLHAHMSYG-RGADLDGVPQPFHPIPNVP 

SEQ ID NO: 6 

SEQ ID NO: 8 EEEDGVDDLEGEFGLQDGAAHEDDPQYVAESMLRAQMSYG-RGGDAH PGFSPVPNVP 

SEQ ID NO: 10 EEEDGVDDLDNEFNW-DG HDSQSVAESMLYGHMSYG-RGGDPNGAPQAFQLNPNVP 

SEQ ID NO: 12 

SEQ ID NO: 14 DEEDV-DDIEHEFNIDEQKNKHGQ VAEAMLHGRMSYG — RGPEDDDNSQFPTPVIAG 

SEQ ID NO: 16 



SEQ ID NO: 18 
SEQ ID NO: 20 
SEQ ID NO: 22 



SEQ ID NO: 23 DEDDV-DDIENEFNYAQGANKARH QRHGE EFSSS — SRHESQPIPLLTHGHTVS 

SEQ ID NO: 24 EEEEDI DDLEYEFD HGMDPEHAAEAALSSRLNTG — RGGLDSAPPG SOIP 

SEQ ID NO: 25 _____ 

SEQ ID NO: 26 IGDLDDD TPDYSSGALPLPAPG------- 

SEQ ID NO: 27 DEEDI-DDIEYEFNIEHEQDKHKH SAEAMLYGKMSYG — RGPEDDENGRFP-PVIAG 

SEQ ID NO: 28 

SEQ ID NO: 29 EEEDDIDDLDNEFEYGN NGIGFDQVSEGMSISRRNSGFPQSDLDSAPPG -SQIP 

241 300 

SEQ ID NO: 2 __ 

SEQ ID NO: 4 LLTNGQMVDDI PPDQHALVPS FV GGGGKRIHPLPYADPNLPVQPRSMDPSKDLAAYG 

SEQ ID NO: 6 _ 

SEQ ID NO: 8 LLTNGQMVDDI PPEQHALVPSYMSGGGGGGKRIHPLPFADPNLPVQPRSMDPSKDLAAYG 

SEQ ID NO: 10 LLTNGQMVDDI PPEQHAL VPS FM GGGGKRIHPLPYADPSLPVQPRSMDPSKDLAAYG 

SEQ ID NO: 12 

SEQ ID NO: 14 GRSR PVSGEFPISSNAYGDQMLSSSLHKRVHPYPVSEPGSARW DEKKXDG 

SEQ ID NO: 16 

SEQ ID NO: 18 ~ " 

SEQ ID NO: 20 

SEQ ID NO: 22 LLTNGQMVDDI PPEQHALVPSYMSGGGGGGKRIHPLPFADPNLPVQPRSMDPSKDLAAYG 

SEQ ID NO: 23 GEIRTPDTQSVRTTSGPLGPSDRNAISSPYIDPR-QPVPVRIVDPSK DLNSYG 

SEQ ID NO: 24 LLTYCDEDADMYSDRHALIVP — PS-TGYGNRVYPAPFTDSSAPPQARSMVPQKDIAEYG 

SEQ ID NO: 25 

SEQ ID NO: 26 KDQRGNNNNMSMMKRNQNGEFDHNRWLF ETQGTYG 

SEQ ID NO: 27 GHS GEFPVGG-GYGNG — EHGLHKRVHPYPSSEAGS EGG 

SEQ ID NO: 28 

SEQ ID NO: 29 LLTYGDEDVEISSDRHALIVP — PSLGGHGNRVHPVSLSDPTVAAHRRLMVPQKDLAVYG 

301 360 

SEQ ID NO: 2 

SEQ ID NO: 4 YGSVAWKERMESWKQKQ-ERMHQTRNDGGGD DG DDADL PLM- DEARQPLSR 

SEQ ID NO: 6 

SEQ ID NO: 8 YGSVAWKERMEGWKQKQ-ERLQHVRSEGGGDW DG DDADL PLM- DEARQPLSR 

SEQ ID NO: 10 YGSVAWKERMENWKQRQ-ERMHQTGNDGGGD DGDDADLPLM-DEARQQLSR 

SEQ ID NO: 12 

SEQ ID NO: 14 WKDRMDDWKLQQG NLGPEPDEDPDAAML- DEARQPLSR 

SEQ ID NO: 16 

SEQ ID NO: 18 HE 

SEQ ID NO: 20 

SEQ ID NO: 22 YGSVAWKERMEGWKQKQ-ERLQHVRSEGGGDW DG DDADL PLM- DEARQPLSR 

SEQ ID NO: 23 LGNVDWKERVEGWKLKQEKNMLQMTGKYHEGKGG-EIEGTGSNGEELQM-ADDTRLPMSR 
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Figure 1 (cont'd.) 

SEQ ID NO: 24 YGSVAWKDRMEVWKRRQGEKLQVIKHEGGNNGRGSN-DDDELDDPDMPMM-DEGROPLSR 
SEQ ID NO: 25 V 

SEQ ID NO: 26 YGNAYWP QDEMYGD DMDEGMRGGMVETADKPWRPLSR 

SEQ ID NO- 28 --WRERMDDWKLQHG NLGPEPDDDPEMGLI-DEARQPLSR 

SEQ ID NO: 29 YGSVAWKDRMEEWKRKQNEKLQWRHEGDP DFEDGDDADFPI^-DEGRQPLSM 

361 420 



KIPLPSSQINPYRMIIIIRLWLCFFFHYRVMHPVPDAFALWLISVICEIWFAMSWILDQ 



SEQ ID NO: 2 
SEQ ID NO: 4 
SEQ ID NO: 6 

SEQ ID NO: 8 KVPISSSRINPYRMIIVIRLVVLGFFFHYRVMHPAKDAFALWLISVICEIWFAMSWILDQ 
SEQ ID NO: 10 KIPLPSSQINPYRMIIIIRLWLGFFFHYRVMHPVNDAFALWLISVICEIWFAMSWILDQ 



KVPIASSKINPYRMVIVARLVILAFFLRYRLMNPVHDALGLWLTSIICEIWFAFSWILDQ 



SEQ ID NO: 12 
SEQ ID NO: 14 
SEQ ID NO: 16 

SEQ ID NO: 18 LHPVNDAYGLWLTSVICEIWFAVSWIMDQ 

SEQ ID NO: 20 

SEQ ID NO: 22 KVPISSSRINPYRMIIVIRLVVLGFFFHYRVMHPAKDAFALWLISVICEIWFAMSCILDQ 

SEQ ID NO:23 VVPIPSSRLTPYRWIILRLIILCFFLQYRTTHPVKNAYPLWLTSVICEIWFAFSWLLDQ 

SEQ ID NO ^5 K ^ PIRSSRINPYRMLILCRLAILa ^ 

SEQ ID NO:26 RI PI PAAI I S PYRLLI VI RFWLCFFLTWRIRNPNEDAIWLWLMSI ICELWFGFSWILDQ 

SEQ ID NO:27 KVPIASSKINPYRMVIVARLVILAVFLRYRLLNPVHDALGLWLTSVICEIWFAVSWILDQ 

SEQ ID NO: 28 

SEQ ID NO: 29 KIPIKSSKINPYRMLIVLRLVILGLFFHYRILHPVKDAYALWLISVICEIWFAVSWLDQ 



SEQ ID NO: 2 



421 480 



SEQ ID NO: 4 FPKWFPIERETYLDRLSLRFDKEGHPS QLAPVDFFVSTVDPLKEPPLVTANTVLS 

SEQ ID NO: 6 

SEQ ID NO: 8 FPKWLPIERETYLDRLSLRFDKEGQPS QLAPI DFFVSTVDPTKEPPLVTANTVLS 

SEQ ID NO: 10 FPKWFPIERETYLDRLSLRFDKEGQPS QLAPI DFFVSTVDPLKEPPLVTTNTVLS 

SEQ ID NO: 12 

SEQ ID NO: 14 FPKWFPIDRETYLDRLSIRYEREGEPN MLAPVDVFVSTVDPMKEPPLVTANTVLS 

SEQ ID NO: 16 

SEQ ID NO: 18 FPKWYPIQRETYLDRLSLRYEKEGKPS ELSSVDVFVSTVDPMKEPPLITANTVLS 

SEQ ID NO:20 

SEQ ID NO: 22 FPKWFPIERETYLDRLSLRFDKEGQPS QLAPI DFFVSTVDPTKEPPLVTANTVLS 

SEQ ID NO: 23 FPKWYPINRETYLDRLAIRYDRDGEPS QLVPVDVFVSTVDPLKEPPLVTANTVLS 

SEQ ID NO: 24 FPKWYPIERETYLDRLSLRYEKEGKPS GLAPVDVFVSTVDPLKEPPLITANTVLS 

SEQ ID NO: 25 

SEQ ID NO:26 IPKLCPINRSTDLEVLRDKFDMPSPSNPTGRSDLPGIDLFVSTADPEKEPPLVTANTILS 

SEQ ID NO: 27 FPKWFPIERETYLDRLSLRYEREGEPN MLAPVDVFVSTVDPLKEPPLVTSNTVLS 

SEQ ID NO: 28 

SEQ ID NO:29 FPKWYPIERETYLDRLSLRYEKEGKPS GLSPVDVFVSTVDPLKEPPLITANTVLS 



SEQ ID NO: 2 



481 540 



SEQ ID NO: 4 ILSVDYPVDKVSCYVSDDGAAMLTFEALSETSEFAKKWVPFCKRYSLEPRAPEWYFQQ — 

SEQ ID NO: 6 H 

SEQ ID NO: 8 ILSVDYPVEKVSCYVSDDGAAMLTFEALSETSEFAKKWVPFSKKFNIEPRAPEWYFQQ— 

SEQ ID NO: 10 ILSVDYPVDKVSCYVSDDGAAMLTFEALSETSEFAKKWVPFCKRYNIEPRAPEWYFQQ— 

SEQ ID NO: 12 I LAAGYPAGKVTC YI S DDAGAEVTRNAVVEAARFAALWVS FCRKHGVE PRNLEAYFNAGE 

SEQ ID NO: 14 ILAMDYPVDKISCYISDDGASMCTFESLSETAEFARKWVPFCKKFSIEPRAPEMYFSE — 

SEQ ID NO: 16 

SEQ ID NO: 18 ILAVDYPVDKVACYVSDDGAAMLTFEALSETSEFARRWVPFCKKYNIEPRAPEWYFGQ — 
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Figure 1 (cont'd.) 

SEQ ID NO: 20 

SEQ ID NO: 22 I LSVDY PVEKVSC YVSDDGAAMLT FEALSETSE FAKKWVPFSKK^ IEP^PEWYFQQ~ 

SEQ ID NO: 23 ILSVDYPVDKVACYVSDDGSAMLTFESLSETAEFAKKWVPFCKKFNIEPRAPEFYFAQ— 

SEQ ID NO: 24 I LAVDYPVDKVAC YVSDDGAAMLT FEALSDTAEFARKWV PFCKKFN IE PRAPEWYFSQ-- 
™ RRWVPFCKKHNVEPRAPEFYFNE— 

SEQ ID NO: 26 ILAVDYPVEKVSCYLSDDGGALLSFEAMAEAASFADLWVPFCRKHNIEPRNPDSYFSL— 

SEQ ID NO: 27 ILAMDYPVEKISCYVSDDGASMLTFESLSETAEFARKWVPFCKKFSIEPRAPEMYFTL- 

SEQ ID NO: 28 DYPVEKVSCYVSDDGAAMLTFEALSETSEFARKWVPFCKKYNIEPRAPEWYFAQ — 

^ 10 NO:29 I LAVDYPVDKVAC YVSDDGAAMLTFEALSETAEFARKWVPFCKKYCIEPRAPEWYFCH — 



SEQ ID NO: 2 
SEQ ID NO: 4 
SEQ ID NO: 6 
SEQ ID NO: 8 
SEQ ID NO: 10 



541 600 



-KIDYLKDKVAPNFVRERRAMKREYEEFKVRINALVAKAQ- 



KIDYLKDKVAASFVRERRAMKREYEEFKVRINALVAKAQ 

KIDYLKDKVAANFVRERRAMKREYEEFKVRINALVAKAQ 

SEQ ID NO: 12 GGGGKAKVVARGSY-RGMAWPELVRDRRRVRREYEEMRLRIDALQAADARRR- 

SEQ ID NO: 14 KIDYLKDKVQPTFVKERRAMKREYEEFKVRINALVAKAQ 

AKAQ 

KMDYLKNKVHPAFVRERRAMKRDYEEFKVRINSLVATAQ 



SEQ ID NO: 16 

SEQ ID NO: 18 

SEQ ID NO: 20 

SEQ ID NO: 22 KIDYLKDKVAASFVRERRAMKREYEEFKVRINALVAKAQ 

KIDYLKDKIQPSFVKERRAMKREYEEFKVRINALVAKAQ 

KMDYLKNKVH PAFVRERRAMKRD YEEFKVKINALVATAQ 

KIDYLKDKVH PS FVKERRAMKRE YEEFKVRINALVAKAQ- 

™ ™ l2 ^ KI DPTKNKSRI DFVKDRRKI KRE YDEFKVRI NGLPDS I RRRS DAFNAREE 

SEQ ID NO: 27 KVDYLQDKVHPT FVKERRAMKRE YEEFKVRINAQVAKAS 

KI DYLKDKVQTS FVKERRAMKRE YEEFKVRVNGLVAKAQ 

KMDYLKNKVH PAFVRERRAMKRD YEEFKVKINALVATAQ 



SEQ ID NO: 23 
SEQ ID NO: 24 
SEQ ID NO: 25 



SEQ ID NO:28 
SEQ ID NO: 29 



601 



SEQ ID NO: 2 



660 



™ ^''i KVPEEGWTMQDGTPWPG NNVRDHPGMIQVFL — 

5LQ ID NO : 6 — — — 



SEQ ID NO: 8 KVPEEGWTMQDGS PWPG 

SEQ ID NO: 10 KVPEEGWTMQDGTPWPG 

SEQ ID NO: 12 

SEQ ID NO: 14 KVPQGGWIMQDGTPWPG— 

SEQ ID NO: 16 KMPEEGWTMQDGTPWPG— 

SEQ ID NO: 18 KVPEDGWTMQDGTPWPG 

SEQ ID NO: 20 

SEQ ID NO: 22 KVPEEGWTMQDGS PWPG 

SEQ ID NO: 23 KI PEEGWTMQDGTPWPG 

SEQ ID NO: 24 KVPEEGWTMQDGTPWPG-— 

SEQ ID NO:25 KKPEEGWVMQDGTPWPG 

SEQ ID NO: 26 MKALKQMRESGGDPTEPVKVPKATW-MADGTHWPGTWAAS 

™ ^11 KVPLEGWIMQDGTPWPG NNTKDHPGMIQVFL 

rr! ^11 KVPEEGWIMQDGTPWPG NNTRDHPGMIQVFL— 

SEQ ID NO: 29 KVPEDGWTMQDGTPWPG NSVRDHPGMIQVFL 



--NNVRDHPGMIQVFL 

--NNVRDHPGMIQVFL 

RRGAADDHAGVVQVLI DFA 

— NNTKDHPGMIQVFL 

— NNPRDHPGMIQVFL 

—NNVRDHPGMIQVFL 



NNTRDHPGMIQVFL 

NNVRDHPGMIQVFL 

NNTRDHPGMIQVYL 



SEQ ID NO: 2 



661 720 



SEQ ID NO: 4 G-QSGGHDVE GNELPRLVYVSREKRPGYNHHKKAGAMNALVRVSAVLTNA 

SEQ ID NO : 6 _ _ 

lln ™ N0:8 G-QSGGRDVE GNELPRLVYVSREKRPGYNHHKKAGAMNALVRVSAVLSNA 

™ v,2 : ^ G-QSGGLDCE GNELPRLVYVSREKRPGYNHHKKAGAMNALVRVSAVLTNA 

SEQ ID NO: 12 GSVPQLGVANGSKLIDVASVDVCLPALVYVCREKRRGHAHHRKAGAMNA 
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Figure 1 (cont'd.) 

SEQ ID NO: 14 G-SSGGLDTE G NQL P RL V Y VSREKRPG FQH H KKAG AMN AL VRVS A VLT NA 

SEQ ID NO: 16 G-HSGGLDTD GNEL PRLV YVS REKRPG FQHHKKAGAMNAL I RVS AVLTNG 

SEQ ID NO: 18 G-QDGVRDVE GNELPRLV YVS REKRPG FDHHKKAGAMNALVRASAI I TNA 

SEQ ID NO: 20 

SEQ ID NO: 22 ZZZZZZZJlll 

SEQ ID NO: 23 G-HSGGLDTD GNELPRLI YVSREKRPG FQHHKKAGAMNALI RVSAVLTNG 

SEQ ID NO: 24 G-HSGVRDTD GNELPRLVYVS REKRPG FDHHKKAGAMNSLI RVSAVLSNA 

SEQ ID NO: 25 G-SAGALDVD GKELPRLVYVSREKRPGYQHHKKAGAENALVRVSAVLTNA 

SEQ ID NO: 26 SSDPLIG-NSDDKVIDFSDTDTRLPMFVYVS REKRPG YDHNKKAGAMNALVRASAILSNG 

SEQ ID NO: 27 G-HSGGFDVE GHELPRLVYVSREKRPGFQHHKKAGAMNALVRVAGVLTNA 

SEQ ID NO: 28 G-QSGGLDAE GNELPRLVYVS REKRPG FQHH KKAGAMN ALVRVSAVLTNG 

SEQ ID NO: 29 G-S DGVRDVE NNELPRLV YVS REKRPG FDHHKKAGAMNSLI RVSGVLSNA 

721 780 

SEQ ID NO: 2 °_ 

SEQ ID NO: 4 PYLLNLDCDHYINNSKAIKEAMCFMMDPLLGKK VC YVQ F PQR FDG I DRH DR Y AN 

SEQ ID NO: 6 

SEQ ID NO: 8 AYLLNLDCDHYINNSKAIKEAMCFMMDPLVGKK VCYVQFPQRFDGI DKNDRYAN 

SEQ ID NO: 10 PYLLNLDCDHYINNSKAIKEAMCFMMDPLLGKK VCYVQFPQRFDGI DRHDRYAN 

SEQ ID NO: 12 PFILDLDCDYYVNNSQALRAGICFMIERGGGGAAEDAGAVAFVQFPQRVDGVDPGDRYAN 

SEQ ID NO: 14 PFMLNLDCDHYVNNSKAAREAMCFLMDPQTGKK VC YVQFPQRFDG I DTH DRYAN 

SEQ ID NO: 16 AYLLNV DCDH Y FNNS KALKEAMC FMMD P VLGKK TCYVQFPQRFDGIDLHDRYAN 

SEQ ID NO: 18 PYLLNVDCDHYINNSKALREAMCFMMDPQLGKK VC YVQFPQRFDG I DRHDRYSN 

SEQ ID NO: 20 EAMC FLMDPNLGPQ VCYVQFPQRFDGI DRNDRYAN 

SEQ ID NO: 22 

SEQ ID NO: 23 AYLLNVDCDHYFNNSKAIKEAMCFMMDPAIGKK CCYVQFPQRFDGIDLHDRYAN 

SEQ ID NO: 24 PYLLNVDCDHYINNSKAIRESMCFMMDPQSGKK VCYVQFPQRFDGI DRHDRYSN 

SEQ ID NO: 25 PFILNLDCDHYINNSKAMREAMCFLMDPQFGKK LC YVQFPQRFDG I DRHDRYAN 

SEQ ID NO: 26 P FI LNLDC DH Y I YNCKAVREGMC FMMDRG- GE D ICYIQFPQRFEGI DPSDRYAN 

SEQ ID NO: 27 P FMLNLDC DH YVNNS KAVREAMC FLMDPQ I G KK VCYVQFPQRFDGI DTNDRYAN 

SEQ ID NO: 28 AFLLNLDCDHYINNSKALREAMCFLMDPNLGKQ VCYVQFPQRFDGI DRNDRYAN 

SEQ ID NO: 29 PYLLNVDCDHYINNSKALREAMCFMMDPQSGKK ICYVQFPQRFDGIDRHDRYSN 



SEQ ID NO: 2 



781 840 



SEQ ID NO: 4 RNVVFFDINMKGLDGIQGPIYVGTGCVFRRQALYGYDAP KTKKPPSRTCNCWPKWCI 

SEQ ID NO: 6 

SEQ ID NO: 8 RNVVFFDINMKGLDGIQGPIYVGTGCVFRRQALYGYDAP KTKKPPSRTCNCWPKWCL 

SEQ ID NO: 10 RN VVFFD I NMKGLDG IQG P I Y VGTGC VFRRQAL YG Y DAP KT KKPPS RTCNC WPKWC F 

SEQ ID NO: 12 HNRVLFDCTELGLDGLQGPIYVGTGCLFRRVALYSVDLPR 

SEQ ID NO: 14 RNTVFFDINMKGLDGIQGPVYVGTGCVFRRQALYGYNPPKGPK — RPKMVSCDC 

SEQ ID NO: 16 RNIVFFDINMKGQDGVQGPVYVGTGCCFNRQALYGYDPVLTEEDLE PNIIV 

SEQ ID NO: 18 RNVVFFDINMKGLDGIQGPIYVGTGCVFRRYALYGYDAP AKKKPPSKTCNCWPKWCC 

SEQ ID NO:20 RNTVFFDINLRGLDGIQGPVYVGTGCVFNRTAIYGYEPPIKAK--K PGFLA 

SEQ ID NO: 22 1„„ 

SEQ ID NO: 23 RNIVFFDI NMKGLDG I QGPVYVGTGCCFNRQALYGYDPVLTEEDLE PNIIV 

SEQ ID NO:24 RNWF FD I NMKGLDG I QGP I YVGTGC VFRRQAL YG FDA P KKKKPPGKTCNCWPKWCC 

SEQ ID NO: 25 RNWFFDINMLGLDGLQGPVYVGTGCVFNRQALYGYDPPVSEKRPK MTCDCWPSWCC 

SEQ ID NO: 26 NNTVFFDGNMRALDGVQGPVYVGTGTMFRRFALYGFDPP 

SEQ ID NO: 27 RNTVFFDINMKGLDGIQGPVYVGTGCVFKRQALYGYEPPKGPK — RPKMISCGC 

SEQ ID NO: 28 RNTVFFDINLRGLDGIQGPVYVGTGCVFNRTALYGYEPPLKPKHRK TGILS 

SEQ ID NO:29 RNWFFDINMKGLDGLQGPIYVGTGCVFRRQALYGFDAP KKKKG P RKTCNCW P KWC L 
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Figure 1 (cont'd.) 



SEQ ID NO: 2 

SEQ ID NO: 4 CCCCFGNRKTKKKTKTSKP KFEKIKKLF-KKKENQAPAYALGEIDEA—APG- 



SEQ ID NO: 8 SCCC SRNKNKKKTTKP 

SEQ ID NO: 10 CCCC FGNRKQKK TTKP 

SEQ ID NO: 12 

SEQ ID NO: 14 -CPCFGSRKKYKE 

SEQ ID NO: 16 -KSCCGSRKKGKGGNKK 

SEQ ID NO: 18 LCC — GSRKKKN ANS 

SEQ ID NO: 20 -SLCXG-KKKASKSKKR 

SEO ID NO: 22 



KTEKKKRLFFKKAENPS PAYALGEI DEG — APG- 
KTEKKKLLFFKKEENQSPAYALGEIDEA — APG- 



KNDANGEAASLKG 
YS-DKKKAMGR — TESTVPIFNMEDIEEGVEG — Y 

KKEKKRKV — KHSEASKQIHALENIEAGN — EG- 

SSDKKKSNKH — VDSSVPVFNLEDIEEGVEGAGF 



SEQ ID NO: 23 -KSCCGSRKKGKSS-KK YNYEKRRGINR — SDSNAPLFNMEDIDEGFEG — Y 

SEQ ID NO: 24 LCC--GLRKKSK T KAKDKKT — NTKETSKQIHALENVDEGVIVPV- 

SEQ ID NO:25 -CCCGGSRKKSKKKGEKKGLLGGLLYGKKKKMMGKNYVKKGSAPVFDLEEIEEGLEG Y 

SEQ ID NO: 26 NPDKLLEKKESETEALTTSDFDPDLDVTQLPKRFGNSTLL AESIPI 

f=S ™ "° s " "CPCFGRRRKNKK FSKNDMNGDVAALGG 

SEQ ID NO: 28 -SLCGGSRKKSSKSSKK GSDKKKSGKH — VDSTVPVFNLEDIEEGVEGAGF 

SEQ ID NO: 29 LCF-GSRKNRK AKT VAADKKK KNREASKQIHALENIEEGRGHKV- 



SEQ ID NO: 2 



901 960 



SEQ ID NO: 4 AENEKASIVNQQKLEKKFGQSSVFVASTLLENGGTLKSASPASLLKEAIHVISCGYEDKT 
SEQ ID NO: 6 ET 

SEQ ID NO: 8 ADIEKAGIVNQQKLEKKFGQSSVFVASTLLENGGTLKSASPASLLKEAIHVISCGYEDKT 
SEQ ID NO: 10 AENEKAGIVNQQKLEKKFGQSSVFVTSTLLENGGTLKSASPASLLKEAIHVISCGYEDKT 
SEQ ID NO: 12 

SEQ ID NO: 14 MDDDKEVLMSQMNFEKKFGQSSIFVTSTLMEEGGVPPSSSPAALL^ 

SEQ ID NO: 16 DD- ERTLLMSQKSLEKRFGQS P VFI AAT FMEQGG I P PS TN PATLLKEAI HVI SCG YE DKT 

SEQ ID NO: 18 TNNEKTSNLTQTKLEKRFGQSPVFVASTLLDDGGVPHGVSPASLLKEAIQVISCGYEDKT 

SEQ ID NO: 20 DD-EKSVLMSQMSLEKRFGQSAAFVASTLMEYGGVPQSSTPESLLKEAIHVISCGYEDKS 

SEQ ID NO: 22 

SEQ ID NO: 23 DD-ERS I LMSQRSVEKRFGQSPVFIAAT FMEQGG I PPTTN PATLLKEAI HVI SCG YE DKT 

SEQ ID NO: 24 SNVEKRSEATQLKLEKKFGQSPVFVASAVLQNGGVPRNASPACLLREAIQVISCGYEDKT 

SEQ ID NO: 25 EELEKSTLMSQKNFEKRFGQSPVFIASTLMENGGLPEGTNSTSLIKEAIHVISCGYEEKT 

SEQ ID NO: 26 AE FQGRPLADHPAV- - KYGRPP GALR VPRDPLDATTVAES VSVI SCWYEDKT 

SEQ ID NO: 27 AEGDKEHLMFEMNFEKTFGQSS I FVTSTLMEEGGVPPSSS PAVLLKEAIHVISCGYEDKT 

SEQ ID NO: 28 DD-EKSLLMSQMSLEKRFGQSAVFVASTLMENGGVPQSATPETLLKEAIHVISCGYEDKT 

SEQ ID NO: 29 LNVEQS TEAMQMKLQKKYGQS PVFVAS ARLENGGMARN AS PACLLKEAI QVI S RG YE DKT 

961 1020 

SEQ ID NO: 2 HEDI LTGFKMH ARGWI S I YCMP PRPC FKGSAP I NLS DRLNQVLRWAL 

SEQ ID NO: 4 GWGKDIGWIYGSVTEDILTGFKMHCHGWRSIYCIPKRAAFKGSAPLNLSDRLHQVLRWAL 
EWGKEIGWIYGSVTEDILTGFKMHCRGWKSVYCTPTRPAFKGSAPINLSDRLHQVLRWAL 
DWGKEIGWIYGSITEDILTGFKMHCHGWRSIYCIPKRPAFKGSAPLNLSDRLHQVLRWAL 
DWGKEIGWI YGS VTEDI LTGFKMHCHGWRS I YCI PKRVAFKGSAPLNLSDRLHQVLRWAL 
SEQ ID NO: 12 WRP RRSL 

SEQ ID NO: 14 EWGLELGWI YGS ITEDILTGFKMHCRGWRS I YCMPKRAAFKGTAP INLS DRLNQVLRWAL 

SEQ ID NO: 16 EWGKEIGWIYGSVTEDILTGFKMHARGWISIYCMPPRPAFKGSAPINLSDRLNQVLRWAL 

SEQ ID NO: 18 EWGKEVGWI YGSVTEDI LTGFKMHCHGWRSVYCI PKRPAFKGSAPINLSDRLHQVLRWAL 

SEQ ID NO: 20 EWGTEIGWIYGSVTEDILTGFKMHARGWRSIYCMPKRPAFKGSAPINLSDRLNQVLRWAL 
SEQ ID NO: 22 

SEQ ID NO:23 EWGKE I GWIYGSVTEDILTGFKMHARGWIS I YCNPPRPAFKGSAP I NLS DRLNQVLRWAL 

" EWGKEIGWIYGSVTEDILTGFKMHCHGWRSVYCMPKRAAFKGSAPINLSDRLHQVLRWAL 

EWGKEIGWIYGSVTEDILTGFKMHCRGWKSVYCVPKRPAFKGSAPINLSDRLHQVLRWAL 

EWGDRVGWIYGSVTEDVVTGYRMHNRGWRSVYCITKRDSFRGSAPINLTDRLHQVLRWAT 



SEQ ID NO: 6 
SEQ ID NO: 8 
SEQ ID NO: 10 



SEQ ID NO:24 
SEQ ID NO: 25 
SEQ ID NO:26 
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Figure 1 (cont'd.) 

SEQ ID NO: 27 EWGTELGWIYGSITEDILTGFKMHCRGWRSI YCMPKRPAFKGSAPINLSDRLNQVLRWAL 

SEQ ID NO: 28 DWGSEIGWIYGSVTEDILTGFKMHARGWRSI YCMPKRPAFKGSAPINLSDRLNQVLRWAL 

SEQ ID NO: 29 EWGKEIGWIYGSVTEDILTGSKMHSHGWRHVYCTPKLAAFKGSAPINLSDRLHQVLRWAL 

1021 1080 

SEQ ID NO: 2 GSVEILFSRHCPIWYNYGG-RLKLLERMAYINTIVYPITSLPLIAYCVLPAICLLTNKFI 

SEQ ID NO: 4 GSIEIFFSNHCPLWYGYGGG-LKFLERFSYINSIVYPWTSIPLLAYCTLPAICLLTGKFI 

SEQ ID NO: 6 GSVEIFMSRHCPLWYAYGG-RLKWLERFAYTNTIVYPFTSIPLLAYCTIPAVCLLTGKFI 

SEQ ID NO: 8 GSVEIFFSKHCPLWYGYGGG-LKFLERFSYINSIVYPWTSIPLLAYCTLPAICLLTGKFI 

SEQ ID NO: 10 GSIEIFFSNHCPLWYGYGGG-LKFLERFSYINSIVYPWTSIPLLAYCTLPAICLLTGKFI 

SEQ ID NO: 12 G CRL 

SEQ ID NO: 14 GSIEIFFSHHCPLWYGFKEKKLKWLERFAYANTTVYPFTSIPLVAYCILPAVCLLTDKFI 

SEQ ID NO: 16 GSIEIFLSRHCPLWYGYNG-KLKPLMRLAYINTIVYPFTSIPLIAYCTLPAFCLLTNKFI 

SEQ ID NO: 18 GSVEIFFSRHCPIWYGYGGG-LKLLERFSYINSWYPWTSLPLLVYCTLPAICLLTGKFI 

SEQ ID NO: 20 GSVEILFSRHCPLWYGYGG-RLKFLERFAYINTTIYPLTSLPLLVYCILPAICLLTGKFI 

SEQ ID NO: 22 

SEQ ID NO: 23 GSIEILLSRHCPIWYGYHG-RLRLLERIAYINTIVYPITSIPLIAYCILPAFCLITDRFI 

SEQ ID NO: 24 GSVEIFLSRHCPIWYGYGGG-LKWLERFSYINSVVYPWTSLPLIVYCSLPAVCLLTGKFI 

SEQ ID NO: 25 GSVEIFLSRHCPLWYGYGG-KLKWLERLAYINTIVYPFTSIPLLAYCTIPAVCLLTGKFI 

SEQ ID NO: 26 GSVEIFFSRNNAI LASKRLKFLQRLAYLNVGIYPFTSLFLILYCFLPAFSLFSGQFI 

SEQ ID NO: 27 GSVEIFFSRHSPLWYGYKGGKLKWLERFAYANTTIYPFTSIPLLAYCILPAICLLTDKFI 

SEQ ID NO: 28 GSVEILFSRHCPIWYGYSG-RLKWLERFAYVNTTIYPVTAIPLLMYCTLPAVCLLTNKFI 

SEQ ID NO: 29 GSVEIFLSRHCPIWYGYGGG-LKWLERLSYINSVVYPWTSLPLIVYCSLPAICLLTGKFI 

1081 114Q 

SEQ ID NO: 2 IPEISNYAGMFFILMFASIFATGILELRWSGVGIEDWWRNEQFWVIGGTSAHLFAVFQGL 

SEQ ID NO: 4 TPELNNVASLWFMSLFICIFATSILEMRWSGVGIDDWWRNEQFWVIGGVSSHLFAVFQGL 

SEQ ID NO: 6 IPTLNNLASIWFIALFLSIIATSVLELRWSGVSIEDWWRNEQFWVIGGVSAHLFAVFQGF 

SEQ ID NO: 8 TPELTNVASIWFMALFICISVTGILEMRWSGVAIDDWWRNEQFWVIGGVSAHLFAVFQGL 

SEQ ID NO: 10 TPELNNVASLWFMSLFICIFATSILEMRWSGVGIDDWWRNEQFWVIGGVSSHLFAVFQGL 

SEQ ID NO: 12 

SEQ ID NO: 14 MPPISTFAGLYFVALFSSIIATGILELKWSGVSIEEWWRNEQFWVIGGVSAHLFAVIQGL 

SEQ ID NO: 16 I PEI SNFASMWFI LLFVS I FTTS I LELRWSGVS I EDWWRNEQFWVIGGTSAHLFAVFQGL 

SEQ ID NO: 18 VPEISNYASLVFMALFISIAATGILEMQWGGVSIDDWWRNEQFWVIGGVSSHLFALFQGL 

SEQ ID NO: 20 MPEISNLASIWFIALFLSIFATGILEMRWSGVGIDEWWRNEQFWVIGGISAHLFAVFQGL 

SEQ ID NO: 22 

SEQ ID NO: 23 I PEI SNYAS IWFILLFIS IAVTGILELRWSGVSIEDWWRNEQFWVIGGTSAHLFAVFQGL 

SEQ ID NO: 24 VPEISNYAGILFMLMFISIAVTGILEMQWGGVGIDDWWRNEQFWVIGGASSHLFALFQGL 

SEQ ID NO: 25 IPTLSNLTSVWFLALFLSIIATGVLELRWSGVSIQDWWRNEQFWVIGGVSAHLFAVFQGL 

SEQ ID NO: 26 VRTLS I S FLVYLLMIT ICLI GLAVLEVKWSGIGLEEWWRNEQWWLI SGTSSHLYAWQGV 

SEQ ID NO: 27 MPPISTFASLFFISLFMSIIVTGILELRWSGVSIEEWWRNEQFWVIGGISAHLFAVVQGL 

SEQ ID NO: 28 I PQI SNLAS IWFI SLFLS I FATGILKMKWNGVG IDQWWRNEQFWVI GGVSAHLFAVFQGL 

SEQ ID NO: 29 VPEISNYASILFMALFSSIAITGILEMQWGKVGIDDWWRNEQFWVIGGVSAHLFALFQGL 

1141 1200 

SEQ ID NO: 2 LKVLAGIDTNFTVTSKANDEDGD — FAELYVFKWTSLLIPPTTVLVINLVGMVAGISYAI 

SEQ ID NO: 4 LKVIAGVDTSFTVTSKGGDD — EE-FSELYTFKWTTLLIPPTTLLLLNFIGWAGISNAI 

SEQ ID NO: 6 LKVLGGVDTSFTVTSKAAGDEADA-FGDLYLFKWTTLLVPPTTLIIINMVGIVAGVSDAV 

SEQ ID NO: 8 LKVFAGIDTSFTVTSKAGDD — EE-FSELYTFKWTTLLIPPTTLLLLNFIGVVAGISNAI 

SEQ ID NO: 10 LKVIAGVDTSFTVTSKGGDD — EE-FSELYTFKWTTLLIPPTTLLLLNFIGWAGVSNAI 

SEQ ID NO: 12 

SEQ ID NO: 14 LKVLAGIDTNFTVTSKATDDE-E — FGELYTFKWTTLLIPPTTILIINIVGWAGISDAI 

SEQ ID NO: 16 LKVLAGI DTNFTVTSKASDEDGD — FAELYVFKWTSLLI PPTTVLIVNLVGIVAGVS YAI 

SEQ ID NO: 18 LKVLAGVNTNFTVTSKAADD — GE-FSELYIFKWTSLLIPPMTLLIMNIVGVWGISDAI 

SEQ ID NO: 20 LKVLAGI DTNFTVTSKANDEEGD— FAELYMFKWTTLLIPPTTIL J INMVGWAGTS YAI 

SEQ ID NO: 22 

SEQ ID NO: 23 LKVLAGI DTNFTVTSKATDEDGD— FAELY I FKWTALLIPPTTVLLVNLIGIVAGVSYAV 
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Figure 1 (cont'd.) 

SEQ ID NO:24 LKVLAGVNTNFTVTSKAADD — GA-FSELYI FKWTTLLI PPTTLLI INI IGVI VGVSDAI 

SEQ ID NO: 25 LKVLAGVDTNFTVTAKAADDTE FGELYLFKWTTLLIPPTTLIILNMVGVVAGVSDAI 

SEQ ID NO: 26 LKVIAGIEISFTLTTKSGGDDNEDIYADLYIVECWSSLMIPPIVIAMVNIIAIWAFIRTI 
SEQ ID NO: 27 LKILAGIDTNFTVTSKATDDD-D — FGELYAFKWTTLLIPPTTVLIINIVGWAGISDAI 
SEQ ID NO: 28 LKVLAGI DTNFTVTSKASDEDGD — FAELYMFKWTTLLI PPTTLLI INLVGWAGIS YVI 
SEQ ID NO: 29 LKVLAGVDTNFTVTSKAADD — GE-FSDLYLFKWTSLLIPPMTLLIINVIGVIVGVSDAI 

1201 126Q 
SEQ ID NO: 2 NSGYQSWGPLFGKLFFSIWVILHLYPFLKGLMGKQNRTPTIVIVWSILLASIFSLLWVKI 

SEQ ID NO: 4 NNGYESWGPLFGKLFFAFWVIVHLYPFLKGLVGRQNRTPTIVIVWSILLASIFSLLWVRI 

SEQ ID NO: 6 NNGYGSWGPLFGKLFFSFWVIVHLYPFLKGLMGRQNRTPTIWLWSILLASIFSLVWVRI 

SEQ ID NO: 8 NNGYESWGPLFGKLFFAFWVIVHLYPFLKGLVGRQNRTPTIVIVWSILLASIFSLLWVRV 

SEQ ID NO: 10 NNGYESWGPLFGKLFFAFWVIVHLYPFLKGLVGRQNRTPTIVIVWSILLASIFSLLWVRI 

SEQ ID NO: 12 LGEDER LWS rm 

SEQ ID NO: 14 NNGYQSWGPLFGKLFFSFWVIVHLYPFLKGLMGRQNRTPTIWIWSVLLASIFSLLWVRI 

SEQ ID NO: 16 NSGYQSWGPLFGKLFFAIWVIAHLYPFLKGLLGRQNRTPTIVIVWSVLLASIFSLLWVRI 

SEQ ID NO: 18 NNGYDSWGPLFGRLFFALWVILHLYPFLKGLLGKQDRMPTIILVWSILLASILTLMWVRI 

SEQ ID NO ! 22 * SGYQSWGPLFGKLFFAFWIVHL ^^ 

SEQ ID NO:23 NSGYQSWGPLFGKLFFALWVIAHLYPFLKGLLGRQNRTPTIVIVWSVLLASIFSLLWRI 

SEQ ID NO: 24 SNGYDSWGPLFGRLFFALWVIVHLYPFLKGMLGKQDKMPTIIVVWSILLASILTLLWVRV 

SEQ ID NO: 25 NNGYGSWGPLFGKLFFAFWVILHLYPFLKGLMGRQNRTPTIVVLWSILLASIFSLVWVRI 

SEQ ID NO: 26 YQAVPQWSKLIGGAFFSFWVLAHLYPFAKGLMGRRGKTPTIVFVWAGLIAITISLLWTAI 

SEQ ID NO: 27 NNGYQSWGPLFGKLFFS EWVIVHLYPFLKGLMGRQNRTPTIVVIWSVLLASIFSLLWVRI 

SEQ ID NO: 28 NSGYQSWGPLFGKLFFAFWVIIHLYPFLKGLMGRQNRTPTIWVWSILLASIFSLLWVRI 

SEQ ID NO: 29 SNGYDSWGPLFGRLFFALWVIIHLYPFLKGLLGKQDRMPTIIWWSILLASILTLLWVRV 

1261 1284 

SEQ ID NO: 2 DPFISDTQKAVAM-GQCGVNC 

SEQ ID NO: 4 DPFLAKDDGPLL — EECGLDCN — 

SEQ ID NO: 6 DPFIPKAKGPILKP-C-GVEC 

SEQ ID NO: 8 DPFLAKSNGPLL — EECGLDCN — 

SEQ ID NO: 10 DPFLAKDDGPLL— EECGLDCN— 

SEQ ID NO: 12 KQMVILSGPR 

SEQ ID NO: 14 DPFVLKTKGPDTKL— CGINC 

SEQ ID NO: 16 DPFTSDSNKLT — NGQCGINC 

SEQ ID NO: 18 NPFVSRD-GPVL — EICGLNCDES 

SEQ ID NO: 20 DPFTTRLAGPNI — QTCGINC 

SEQ ID NO: 22 

SEQ ID NO: 23 NPFVDANPNANNFNGKGGVF 

SEQ ID NO: 24 NPFVAKG-GPVL — EICGLNCGN- 

SEQ ID NO: 25 DPFLPKQTGPVLKQ-C-GVEC 

SEQ ID NO: 26 NP NTGPAAAAEGVGGGGFQFP 

SEQ ID NO: 27 DPFVLKTKGPDTSK — CGINC 

SEQ ID NO: 28 DPFTTRVTGPDV — EQCGINC 

SEQ ID NO: 29 NPFVAKG-GPIL — EICGLDC — L 
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SEQUENCE LISTING 

<110> E. I. du Pont de Nemours and Company 

<120> Plant Cellulose Synthases 

<130> BB-1170 

<140> 
<141> 

<150> 60/092,844 

<151> July 14, 1998 

<160> 29 



<170> Microsoft Office 97 

<210> 1 

<211> 1221 

<212> DNA 

<213> Hordeum vulaare 



<400> 1 

gcacgaggat 

catgccacca 

tcaagttctc 

ctggtacaat 

tgtttatcca 

cctcaccaac 

tatgtttgcc 

cgaggactgg 

cgcagtgttc 

ctcgaaggca 

tctcctcatt 

atcatatgcc 

ctcaatctgg 

ccgcacgcca 

gtgggtgaag 

tggcgtcaac 

gggttggatg 

cccttcgtgc 

ttttttctcc 

cagaacgtat 

tgcatcaact 



attcttactg 

cgaccttgtt 

cggtgggctc 

tacggtgggc 

ataacatccc 

aaatttatca 

tccatctttg 

tggagaaacg 

cagggtctgc 

aacgacgagg 

cctccgacca 

atcaacagcg 

gtgatcctcc 

accatcgtca 

atcgaccctt 

tgctgatcgg 

ggattttgtt 

tgagaagcac 

ttttcttttc 

ctatagttct 

cttctgcaaa 



ggtttaaaat 
tcaagggttc 
ttgggtcagt 
ggttgaaact 
ttccacttat 
ttcccgagat 
ccacgggtat 
agcagttctg 
tgaaggtgtt 
atggcgattt 
ccgtccttgt 
gttaccagtc 
atctctaccc 
ttgtttggtc 
tcatatccga 
cgccgaagag 
gttgtagatg 
aaaccgtgaa 
atctgtgata 
atacacacta 
a 



gcacgcaaga 
tgcgccaatc 
tgaaattctg 
tctggagagg 
cgcctattgt 
cagtaactat 
attggagctg 
ggttattggt 
ggccgggatt 
tgctgagtta 
gattaacctg 
ttggggtcca 
cttcctcaag 
catcctccta 
tacccagaaa 
tatctgcccc 
aagacggagt 
gcctacgaaa 
cctgttgttt 
tgacaccaac 



ggttggatat 
aatctctctg 
tttagcagac 
atggcttaca 
gtgcttcctg 
gctgggatgt 
cgatggagtg 
ggcacatctg 
gacaccaact 
tacgtgttca 
gtgggcatgg 
ctcttcggaa 
ggtctcatgg 
gcctccatct 
gccgtcgcca 
cctcgtgtaa 
tttatgtaag 
cctgcagcgt 
cttcttagag 
tatttatata 



caatctactg 

accgtctcaa 

attgtcctat 

tcaacaccat 

ctatctgtct 

tctttattct 

gtgtcggcat 

cccatctttt 

tcacggttac 

agtggaccag 

tggcaggcat 

agctcttctt 

ggaagcagaa 

tctccctcct 

tggggcagtg 

ataccggagg 

ttattattgc 

acattgtgat 

tatattatgt 

aggcagctgt 



<210> 2 

<211> 304 

<212> PRT 

<213> Hordeum vulgare 

<400> 2 

His Glu Asp He Leu Thr Gly Phe Lys Met His Ala Arg Gly Trp lie 
1 5 10 15 

Ser lie Tyr Cys Met Pro Pro Arg Pro Cys Phe Lys Gly Ser Ala Pro 
20 25 30 

lie Asn Leu Ser Asp Arg Leu Asn Gin Val Leu Arg Tro Ala Leu Gly 
35 40 45 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1221 
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Ser Val Glu He Leu Phe Ser Arg His Cys Pro He Trp Tyr Asn Tvr 
50 55 60 

Gly Gly Arg Leu Lys Leu Leu Glu Arg Met Ala Tyr He Asn Thr He 
65 70 75 80 

Val Tyr Pro He Thr Ser Leu Pro Leu He Ala Tyr Cys Val Leu Pro 
85 90 95 

Ala He Cys Leu Leu Thr Asn Lys Phe He He Pro Glu He Ser Asn 
100 105 no 

Tyr Ala Gly Met Phe Phe He Leu Met Phe Ala Ser He Phe Ala Thr 
115 120 125 

Gly He Leu Glu Leu Arg Trp Ser Gly Val Gly He Glu Asp Trp Tro 
130 135 HO 

Arg Asn Glu Gin Phe Trp Val He Gly Gly Thr Ser Ala His Leu Phe 
145 150 155 160 

Ala Val Phe Gin Gly Leu Leu Lys Val Leu Ala Gly He Asd Thr Asn 
165 170 * 175 

Phe Thr Val Thr Ser Lys Ala Asn Asp Glu Asp Gly Asp Phe Ala Glu 
180 185 190 

Leu Tyr Val Phe Lys Trp Thr Ser Leu Leu He Pro Pro Thr Thr Val 
195 200 205 

Leu Val He Asn Leu Val Gly Met Val Ala Gly He Ser Tyr Ala He 
210 215 220 

Asn Ser Gly Tyr Gin Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe 
225 230 235 240 

Ser He Trp Val He Leu His Leu Tyr Pro Phe Leu Lys Gly Leu Met 
245 250 ' 255 

Gly Lys Gin Asn Arg Thr Pro Thr lie Val He Val Trp Ser lie Leu 
260 265 270 

Leu Ala Ser He Phe Ser Leu Leu Trp Val Lys He Asp Pro Phe He 
275 280 285 

Ser Asp Thr Gin Lys Ala Val Ala Met Gly Gin Cys Gly Val Asn Cys 
290 295 _ 300 

<210> 3 

<211> 3776 

<212> DNA 

<213> Zea mays 

<400> 3 

gcgcgccgcg caggcgcaac gcaacaaagg gaaaccccag ccggaggagc aaaagctagc 60 

aagcgtgtcc ctccccctcc ctcactcccg tttcattcca ttccccccca gacgccgcta 120 

ccgccgccgc cgcacgcacg cttgccccgg gatctggaga tctggtagcg ccagggggat 180 

ggaggccagc gccgggctgg tcgccggctc gcacaaccgg aacgagctcg tcgtcatccg 240 

ccgcgatggc gagccagggc cgaagcccat ggaccagcgg aacggccagg tgtgccagat 300 
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ttgcggcgac gacgtggggc gcaaccccga cggggagccg ttcgtggcct gcaacgagtg 360 
cgccttcccc atctgccggg actgctacga gtacgagcgc cgcgagggca cgcagaactg 420 
cccccagtgc aagacccgct tcaagcgcct caaggggtgc gcgcgcgtgc ccggggacga 480 
ggaggaggac ggcgtcgacg acctggagaa cgagttcaac tggagcgaca agcacgactc 540 
ccagtacctc gccgagtcca tgctccacgc ccacatgagc tacggccgcg gcgccgacct 600 
cgacggcgtg ccgcagccat tccaccccat ccccaatgtt cccctcctca ccaacggaca 660 
gatggtcgat gacatcccgc cggaccagca cgcccttgtg ccctcgttcg tgggtggcgg 720 
ggggaagagg attcaccctc tcccgtacgc ggatcccaac cttcctgtgc aaccgaggtc 780 
tatggaccct tccaaggatc tcgccgcata tggctacggg agcgtagcat ggaaggagag 840 
gatggagagc tggaagcaga agcaggagag gatgcaccag acgaggaacg atggcggcgg 900 
cgatgatggt gatgatgcag atctaccact aatggatgaa gctagacagc cattgtccag 960 
aaagatcccg cttccttcaa gccaaatcaa cccctatagg atgattataa taattcggct 1020 
agtggttttg tgtttcttct tccactaccg agtgatgcat ccggtgcctg atgcatttgc 1080 
tttatggctc atatctgtga tctgtgaaat ttggtttgcc atgtcttgga ttcttgacca 1140 
gtttccaaag tggtttccta tcgagaggga aacctatctt gaccggctga gtttaaggtt 1200 
tgacaaggaa gggcatcctt ctcaactcgc ccctgttgat ttctttgtca gtacggttga 1260 
tcccttgaag gaacctccat tggtcactgc taatactgtt ctatctatcc tttcggtgga 1320 
ttatccagtt gataaggttt catgctacgt ttctgatgat ggtgctgcca tgctgacatt 1380 
tgaagcattg tctgaaacat ctgaatttgc aaagaaatgg gttcctttct gcaaaagata 1440 
tagccttgag cctcgtgctc cagagtggta cttccaacag aagatagact acctgaaaga 1500 
caaggtggcg ccaaactttg ttagagaacg gagagcaatg aagagagagt atgaggaatt 1560 
caaggtcaga atcaatgcct tggttgctaa agcccaaaag gttcctgaag aaggatggac 1620 
aatgcaggat ggaactccat ggcccggaaa taatgtccgt gatcaccctg gaatgattca 1680 
ggttttcctr ggtcaaagtg gtggccatga tgtggaagga aatgagctgc ctcgattggt 1740 
ttatgtttca agagaaaaac ggccaggcta caaccatcac aagaaggctg gtgctatgaa 1800 
tgcattggtc cgagtctctg ctgtactaac taatgctcct tatttgctga acttggattg 1860 
tgatcactat atcaataata gtaaggctat aaaggaagca atgtgtttta tgatggatcc 1920 
tttgcttgga aagaaagttt gctatgtgca gtttcctcaa agatttgatg ggattgatcg 1980 
ccatgatcga tatgctaaca gaaatgttgt ctttttcgat atcaacataa aaggtttgga 2040 
tggtatccag ggcccaattt atgtgggtac tggatgtgtc ttcagaaggc aggcattata 2100 
tggctacgat gctcccaaaa caaagaagcc accatcaaga acttgcaact gctggccaaa 2160 
gtggtgcatt tgctgttgct gttttggtaa caggaagacc aagaagaaga ccaagacctc 2220 
taaacctaaa tttgagaaga taaagaaact ttttaagaaa aaggaaaatc aagcccctgc 2280 
atatgctctt ggtgaaattg atgaagccgc tccaggagct gaaaatgaaa aggctagtat 2340 
tgtaaatcaa cagaagttgg aaaagaaatt tggccagtct tcagtttttg ttgcatccac 2400 
acttcttgag aatggtggaa ccctgaagag tgccagtcca gcttctcttc tgaagaaagc 24 60 
tatacatgtc atcagttgtg gatatgaaga caaaacaggc tggggaaaag atattggttg 2520 
gatttatgga ccagtcacag aagatattct tactgggttt aagatgcact gccatggttg 2580 
gcggtcaatt tactgcatac ctaaacgggc cgccttcaaa ggttccgcac ctctcaatct 2640 
ttccgatcgt cttcaccagg ttcttcggtg ggctcttggt tcaatrgaaa ttttcttcag 2700 
caaccactgc cctctctggt atgggtatgg tggtggacta aagttcctgg aaaggttttc 2760 
gtacattaac tccatcgtat acccttggac atctatcccg ctcttggcct attgcacatt 2820 
gcctgccatc tgcttgctga cagggaaatt tatcacgcca gagcttaaca atgttgccag 2880 
cctctggttc atgtcacttt tcatctgcat ttttgctacg agcatcctgg aaatgagatg 2940 
gagtggtgta ggcatcgatg actggtggag aaacgagcag ttttgggtca ttggaggcgt 3000 
gtcttcacat ctctttgctg tgttccaggg actcctcaag gtcatagctg gtgtagacac 3060 
gagcttcact gtgacatcca agggcggaga cgacgaggag ttctcagagc tgtacacatt 3120 
caaatggacg acccttctga tacctccgac aaccctgctc ctactgaact tcattggagt 3180 
ggtagctggc atctccaatg cgatcaacaa cggatatgaa tcatggggcc ccctgttcgg 324 0 
gaagctcttc tttgcatttt gggtgatcgt ccatctttac ccgttcctca agggtctggt 3300 
tgggaggcag aacaggacgc caacgattgt cattgtctgg tccatcctcc tggcttcgat 3360 
cttctcgctg ctttgggtcc ggatcgaccc gttccttgcg aaggatgatg gtcccctgtt 3420 
ggaggagtgt ggtctggatt gcaactagga ggtcagcacg tggacttccc cgtcagtgtg 3480 
tggtcgaaga agtatttttg cagatgtttt gtgcccatat ttcttttttc aatttttgtc 3540 
cctctgtaga tagaaacaag gggagaaggg gaaaaaaagt acttgtattt cttttgttcc 3600 
atggtggtgg tggtggtggg cggctcagcc tcgtgagtgc agtattgggc aaaccggagg 3660 
ctgcggcaac cttgtgcagt tcggccacga atatactagg gaagatcgcg accaatcaat 3720 
caatcgatga ccgagttcaa ttgttcagca aaaaaaaaaa aaaaaaaaaa aaaaaa 3776 
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<210> 4 

<211> 1148 

<212> PRT 

<213> Zea mays 

<400> 4 

Arg Ala Ala Gin Ala Gin Arg Asn Lys Gly Lys Pro Gin Pro Glu Glu 
1 5 io * 15 

Gin Lys Leu Ala Ser Val Ser Leu Pro Leu Pro His Ser Arg Phe He 
20 25 30 

Pro Phe Pro Pro Arg Arg Arg Tyr Arg Arg Arg Arg Thr His Ala Cys 
35 40 45 

Pro Gly He Trp Arg Ser Gly Ser Ala Arg Gly Met Glu Ala Ser Ala 
50 55 60 

Gly Leu Val Ala Gly Ser His Asn Arg Asn Glu Leu Val Val He Arg 
65 70 75 80 

Arg Asp Gly Glu Pro Gly Pro Lys Pro Met Asp Gin Arg Asn Gly Gin 
85 90 95 

Val Cys Gin He Cys Gly Asp Asp Val Glv Arg Asn Pro Asp Gly Glu 
100 105 no 

Pro Phe Val Ala Cys Asn Glu Cys Ala Phe Pro He Cys Arg Asp Cys 
U5 120 125 

Tyr Glu Tyr Glu Arg Arg Glu Gly Thr Gin Asn Cys Pro Gin Cys Lys 
130 135 14 0 

Thr Arg Phe Lys Arg Leu Lys Gly Cys Ala Arg Val Pro Gly Asp Glu 
145 150 155 160 

Glu Glu Asp Gly Val Asp Asp Leu Glu Asn Glu Phe Asn Trp Ser Asp 
165 170 175 

Lys His Asp Ser Gin Tyr Leu Ala Glu Ser Met Leu His Ala His Met 
180 185 190 

Ser Tyr Gly Arg Gly Ala Asp Leu Asp Gly Val Pro Gin Pro Phe His 
195 200 205 

Pro He Pro Asn Val Pro Leu Leu Thr Asn Gly Gin Met Val Asp Asp 
210 215 220 

He Pro Pro Asp Gin His Ala Leu Val Pro Ser Phe Val Glv Gly Gly 
225 230 235 - 240 

Gly Lys Arg He His Pro Leu Pro Tyr Ala Asp Pro Asn Leu Pro Val 
245 250 255 

Gin Pro Arg Ser Met Asp Pro Ser Lys Asp Leu Ala Ala Tyr Gly Tyr 
260 265 270 

Gly Ser Val Ala Trp Lys Glu Arg Met Glu Ser Trp Lys Gin Lys Gin 
275 280 285 



4 



WO 00/04166 



PCT/US99/15871 



Glu Arg Met His Gin Thr Arg Asn Asp Gly Gly Gly Asd Asp Gly Asp 
290 295 300 ' 

Asp Ala Asp Leu Pro Leu Met Asp Glu Ala Arg Gin Pro Leu Ser Ara 
305 310 315 320 

Lys He Pro Leu Pro Ser Ser Gin He Asn Pro Tyr Arg Met He He 
325 330 ~ 335 

lie lie Arg Leu Val Val Leu Cys Phe Phe Phe His Tyr Arg Val Met 
340 345 350 

His Pro Val Pro Asp Ala Phe Ala Leu Trp Leu He Ser Val lie Cys 
355 360 365 

Glu lie Trp Phe Ala Met Ser Trp lie Leu Asp Gin Phe Pro Lys Trp 
370 375 380 

Phe Pro lie Glu Arg Glu Thr Tyr Leu Asp Arg Leu Ser Leu Ara Phe 
385 390 395 400 

Asp Lys Glu Gly His Pro Ser Gin Leu Ala Pro Val Asd Phe Phe Val 
^05 410 4 i5 

Ser Thr Val Asp Pro Leu Lys Glu Pro Pro Leu Val Thr Ala Psn Thr 
420 425 43 0 

Val Leu Ser He Leu Ser Val Asp Tyr Pro Val Asp Lys Val Ser Cys 
435 440 445 

Tyr Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe Glu Ala Leu Ser 
450 455 460 

Glu Thr Ser Glu Phe Ala Lys Lys Trp Val Pro Phe Cys Lys Arg Tyr 
465 470 475 480 

Ser Leu Glu Pro Arg Ala Pro Glu Trp Tyr Phe Gin Gin Lys He Asp 
485 490 495 

Tyr Leu Lys Asp Lys Val Ala Pro Asn Phe Val Arg Giu Arg Arg Ala 
500 505 ' 510 

Met Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg He Asn Ala Leu Val 
515 520 525 

Ala Lys Ala Gin Lys Val Pro Glu Glu Gly Trp Thr Met Gin Asd Glv 
530 535 540 

Thr Pro Trp Pro Gly Asn Asn Val Arg Asp His Pro Gly Met lie Gin 
545 550 555 " 560 

Val Phe Leu Gly Gin Ser Gly Gly His Asp Val Glu Gly Asn Glu Leu 
565 570 ' 575 

Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro Gly Tyr Asn His 
580 585 590 



His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Arg Val Ser Ma Val 
595 600 6 05 
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Leu Thr Asn Ala Pro Tyr Leu Leu Asn Leu Asp Cys Asp His Tyr He 
610 615 620 

Asn Asn Ser Lys Ala He Lys Glu Ala Met Cys Phe Met Met Asp Pro 
625 630 635 6 40 

Leu Leu Gly Lys Lys Val Cys Tyr Val Gin Phe Pro Gin Arg Phe Asp 
645 650 655 

Gly lie Asp Arg His Asp Arg Tyr Ala Asn Arg Asn Val Val Phe Phe 
660 665 670 

Asp He Asn Met Lys Gly Leu Asp Gly He Gin Gly Pro lie Tyr Val 
675 680 685 

Gly Thr Gly Cys Val Phe Arg Arg Gin Ala Leu Tyr Gly Tyr Asp Ala 
690 695 700 

Pro Lys Thr Lys Lys Pro Pro Ser Arg Thr Cys Asn Cys Trp Pro Lys 
705 7 *0 715 720 

Trp Cys lie Cys Cys Cys Cys Phe Gly Asn Arg Lys Thr Lys Lys Lvs 
725 730 735 " 

Thr Lys Thr Ser Lys Pro Lys Phe Glu Lys lie Lys Lys Leu Phe Lys 
740 745 75Q 

Lys Lys Glu Asn Gin Ala Pro Ala Tyr Ala Leu Gly Glu He Asp Glu 
755 760 765 

Ala Ala Pro Gly Ala Glu Asn Glu Lys Ala Ser He Val Asn Gin Gin 
770 775 780 

Lys Leu Glu Lys Lys Phe Gly Gin Ser Ser Val Phe Val Ala Ser Thr 
785 790 795 800 

Leu Leu Glu Asn Gly Gly Thr Leu Lys Ser Ala Ser Pro Ala Ser Leu 
805 810 815 

Leu Lys Glu Ala He His Val lie Ser Cys Gly Tyr Glu Asp Lvs Thr 
820 825 830 

Gly Trp Gly Lys Asp He Gly Trp He Tyr Gly Ser Val Thr Glu Asp 
835 840 845 

He Leu Thr Gly Phe Lys Met His Cys His Gly Trp Arg Ser He Tvr 
850 855 860 



Cys He Pro Lys Arg Ala Ala Phe Lys Gly Ser Ala Pro Leu Asn Leu 
865 870 875 880 

Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly Ser He Glu 
885 890 895 

He Phe Phe Ser Asn His Cys Pro Leu Trp Tyr Gly Tyr Gly Gly Glv 
900 905 910 

Leu Lys Phe Leu Glu Arg Phe Ser Tyr He Asn Ser He Val Tyr Pro 
915 920 925 
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Trp Thr Ser lie Pro Leu Leu Ala Tyr Cys Thr Leu Pro Ala He Cys 
930 935 940 

Leu Leu Thr Gly Lys Phe He Thr Pro Glu Leu Asn Asn Val Ala Ser 
945 950 955 960 

Leu Trp Phe Met Ser Leu Phe He Cys He Phe Ala Thr Ser He Leu 
965 970 975 

Glu Met Arg Trp Ser Gly Val Gly He Asp Asp Trp Trp Arg Asn Glu 
980 985 990 

Gin Phe Trp Val He Gly Gly Val Ser Ser His Leu Phe Ala Val Phe 
995 1000 1005 

Gin Gly Leu Leu Lys Val He Ala Gly Val Asp Thr Ser Phe Thr Val 
1010 1015 1020 

Thr Ser Lys Gly Gly Asp Asp Glu Glu Phe Ser Glu Leu Tyr Thr Phe 
1025 1030 103 5 1040 

Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu Leu Leu Leu Asn 
1045 1050 1055 

Phe He Gly Val Val Ala Gly He Ser Asn Ala lie Asn Asn Gly Tyr 
1060 1065 1070 

Glu Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe Trp Val 
1075 1080 1085 

He Val His Leu Tyr Pro Phe Leu Lys Gly Leu Val Gly Arg Gin Asn 
1090 1095 noo 

Arg Thr Pro Thr He Val He Val Trp Ser He Leu Leu Ala Ser He 
1105 1110 ins H20 

Phe Ser Leu Leu Trp Val Arg He Asp Pro Phe Leu Ala Lys Asd Asp 
1125 H30 

Gly Pro Leu Leu Glu Glu Cvs Gly Leu Asd Cys Asn 
1140 H45 



<210> 5 

<211> 1189 

<212> DNA 

<213> Zea mays 

<400> 5 

gcacgagacc gagtggggca aggagattgg 
cctgacgggg ttcaagatgc actgccgggg 
gccggcgttc aaggggtcgg cgcccatcaa 
ctgggcgctg gggtccgtgg agatcttcat 
cggcggccgg ctcaagtggc tggagcgctt 
cacctccatc ccgctcctcg cctactgcac 
gttcatcatt cccacgctga acaacctcgc 
catcatcgcg acgagcgtcc tggagctgcg 
gcgcaacgag cagttctggg tcatcggcgg 
gggcttcctc aaggttctgg gcggcgtgga 
cggcgacgag gccgacgcct tcggggacct 
gccccccacc acgctcatca tcatcaacat 



gtggatctat gggtcggtga cagaggatat 60 

gtggaagtcc gtgtactgca cgccgacacg 120 

cttgtctgat cgtctccacc aggtgctgcg 180 

gagccgccac tgcccgctct ggtacgccta 240 

cgcctacacc aacaccatcg tgtacccctt 300 

catccccgcc gtctgcctgc tcaccggcaa 360 

cagcatctgg ttcatcgcgc tcttcctgtc 420 

gtggagcggg gtgagcatcg aggactggtg 4 80 

cgtgtccgcg catctcttcg ccgtgttcca 540 

caccagcttc accgtcacct ccaaggcggc 600 

ctacctcttc aagtggacca ccctgctggt 660 

ggtgggcatc gtggccggcg tgtccgacgc 720 
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cgtcaacaac ggctacggct cctggggccc gctcttcggc aagctcttct tctccttctg 780 

ggtcatcgtc cacctctacc cgttcctcaa ggggctcatg gggaggcaga accggacgcc 840 

caccatcgtc gtgctctggt ccatcctcct cgcctccatc ttctcgctcg tctgggtcag 900 

gatcgacccg tttatcccga aggccaaggg ccccatcctc aagccatgcg gagtcgagtg 960 

ctgagctcac ctagctacct tcttgttgca tgtacggacg ccgccgtgcg tttggacata 1020 

caggcacttt tgggccaggc tactcatgtt cgactttttt tttaattttg tacaagattt 1080 

gtgatcgagt gactgagtga gacagagtgt tgggtgtaag aactgtgatg gaattcactc 1140 

aaattaatgg acattttttt tcttcaactg caaaaaaaaa aaaaaaaaa H89 

<210> 6 

<211> 320 

<212> PRT 

<213> Zea mays 

<400> 6 

His Glu Thr Glu Trp Gly Lys Glu lie- Gly Trp lie Tyr Gly Ser Val 
1 5 10 is 

Thr Glu Asp lie Leu Thr Gly Phe Lys Met His Cys Arg Gly Trp Lys 
20 25 30 

Ser Val Tyr Cys Thr Pro Thr Arg Pro Ala Phe Lys Gly Ser Ala Pro 
35 40 45 

He Asn Leu Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly 
50 55 60 

Ser Val Glu He Phe Met Ser Arg His Cys Pro Leu Trp Tyr Ala Tyr 
65 70 75 go 

Gly Gly Arg Leu Lys Trp Leu Glu Arg Phe Ala Tyr Thr Asn Thr He 
85 90 95 

Val Tyr Pro Phe Thr Ser lie Pro Leu Leu Ala Tyr Cys Thr He Pro 
100 105 no 

Ala Val Cys Leu Leu Thr Gly Lys Phe He He Pro Thr Leu Asn Asn 
H5 120 125 

Leu Ala Ser He Trp Phe He Ala Leu Phe Leu Ser He He Ala Thr 
130 135 140 

Ser Val Leu Glu Leu Arg Trp Ser Gly Val Ser He Glu Asp Trp Trp 
145 150 155 ^ 160 

Arg Asn Glu Gin Phe Trp Val He Gly Gly Val Ser Ala His Leu Phe 
165 170 175 

Ala Val Phe Gin Gly Phe Leu Lys Val Leu Gly Gly Val Asp Thr Ser 
180 185 190 

Phe Thr Val Thr Ser Lys Ala Ala Gly Asp Glu Ala Asp Ala Phe Gly 
195 200 205 

Asp Leu Tyr Leu Phe Lys Trp Thr Thr Leu Leu Val Pro Pro Thr Thr 
210 215 220 

Leu He He He Asn Met Val Gly He Val Ala Gly Val Ser Asp Ala 
225 2 30 235 240 
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Val Asn Asn Gly Tyr Gly Ser Trp 
24 5 

Phe Ser Phe Trp Val He Val His 
260 

Met Gly Arg Gin Asn Arg Thr Pro 
275 280 

Leu Leu Ala Ser He Phe Ser Leu 
290 295 



Gly Pro Leu Phe Gly Lys Leu Phe 
250 255 

Leu Tyr Pro Phe Leu Lys Gly Leu 
265 270 

Thr He Val Val Leu Trp Ser He 
285 

Val Trp Val Arg He Asp Pro Phe 
300 



He Pro Lys Ala Lys Gly Pro lie Leu Lys Pro Cys Gly Val Glu Cvs 
305 310 315 320 

<210> 7 

<211> 3786 

<212> DNA 

<213> Zea mays 



<400> 7 

ccacagctca tataccaaga gccggagcag 
agcacaaccc ccacccgcca cagccgcgtg 
ccagaggagg ggaggactac gtgcatttcg 
gcgagatccg gcggggcggg gcggggggcc 
cggctcgcat aaccggaacg agctggtggt 
gggcggcggc gcggcgcgcc gggcggaggc 
ggtgggcttc gacggggagc ccttcgtggc 
cgcctgctac gagtacgagc gccgcgaggg 
ctacaagcgc ctcaagggct gcccgcgggt 
cgacctggag ggcgagttcg gcctgcagga 
cgtcgccgag tccatgctca gggcgcagat 
cggcttcagc cccgtcccca acgtgccgct 
cccgccggag cagcacgcgc tcgtgccgtc 
gaggatccac ccgctccctt tcgcagatcc 
cccgtccaag gatctggccg cctacggata 
gggctggaag cagaagcagg agcgcctgca 
ggatggcgac gatgcagatc tgccactaat 
agtccctata tcatcaagcc gaattaaccc 
ggttttgggt ttcttcttcc actaccgagt 
gtggctcata tctgtaatct gtgaaatctg 
cccaaagtgg cttccaatcg agagagagac 
caaggaaggt caaccctctc agcttgctcc 
cacaaaggaa cctcccttgg tcacagcgaa 
tccggttgag aaggtctcct gctatgtttc 
agcattgtct gaaacatctg aatttgcaaa 
tatcgagcct cgtgctcctg agtggtactt 
ggttgctgct tcatttgtta gggagaggag 
ggtaaggatc aatgccttgg ttgcaaaagc 
gcaagatgga agcccctggc ctggaaacaa 
attccttggc caaagtggcg gtcgtgatgt 
tgtctcgaga gaaaagaggc caggttataa 
actggtccgt gtctctgctg tcttatcaaa 
tcactacatc aacaatagca aggccataaa 
ggtggggaag aaagtgtgct atgtacagtt 
tgatcgatac gctaacagga acgttgtctt 
tattcaagga cccatttatg tgggtactgg 
ttatgatgct cctaaaacga agaagccacc 
gtgcctctct tgctgctgca gcaggaacaa 
ggagaagaag aaaagattat ttrtcaagaa 



Cutagcgcag cccagagcgg cgccgcgcca 60 
cgcatgtgag cggtcgccgc ggccgggaga 120 
ctgtgccgcc gccgcggggt tcgtgcgcga 180 
tgagatggag gctagcgcgg ggctggtggc 240 
gatccgccgc gaccgcgagt cgggagccgc 300 
gccgtgccag atatgcggcg acgaggtcgg 360 
gtgcaacgag tgcgccttcc ccgtctgccg 420 
ctcgcaagcg tgcccgcagt gcaggacccg 4 80 
ggccggcgac gaggaggagg acggcgtcga 540 
cggcgccgcc cacgaggacg acccgcagta 600 
gagctacggc cgcggcggcg acgcgcaccc 660 
cctcaccaac ggccagatgg ttgatgacat 720 
ctacatgagc ggcggcggcg gcgggggcaa 780 
caaccttcca gtgcaaccga gatccatgga 840 
tggcagcgtg gcctggaagg agagaatgga 900 
gcatgtcagg agcgagggtg gcggtgattg 960 
ggatgaagct aggcagccat tgtccagaaa 1020 
ctacaggatg attatcgtta tccggttggt 1080 
gatgcatccg gcgaaagatg catttgcatt 1140 
gtttgcgatg tcctggattc ttgatcagtt 1200 
ttacctggac cgtttgtcac taaggtttga 1260 
aatcgacttc tttgtcagta cggttgatcc 1320 
cactgtcctt tccatccttt ctgtggatta 1380 
tgatgatggt gctgcaatgc ttacgtttga 1440 
gaaatgggtt cctttcagca aaaagtttaa 1500 
ccaacagaag atagactacc tgaaagacaa 1560 
ggcgatgaag agagaatacg aggaattcaa 1620 
ccaaaaggtt cctgaggaag gatggacaat 1680 
cgtacgcgat catcctggaa tgattcaggt 1740 
ggaaggaaat gagttgcctc gcctggttta 1800 
ccatcacaag aaggctggtg ccatgaatgc 1860 
tgctgcatac ctattgaact tggactgtga 1920 
agaggctatg tgtttcatga tggatccttt 1980 
ccctcagagg tttgatggta ttgacaaaaa 2040 
ttttgacatc aacatgaaag gtttggacgg 2100 
atgtgttttc agacggcagg cactgtatgg 2160 
atcaagaact tgcaactgct ggcccaagtg 2220 
gaataaaaag aagactacaa aaccaaagac 2280 
agcagaaaac ccatctcctg catatgcttt 234 0 
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gggtgaaatt gatgaaggtg ctccaggtgc tgatatcgag aaggccggaa tcgtaaatca 2400 
acagaaacta gagaagaaat ttgggcagtc ttctgttttt gtcgcatcaa cacttcttga 2460 
gaacggaggg accctgaaga gcgcaagtcc agcttctctt ctgaaggaag ctatacatgt 2520 
tatcagctgc ggctacgaag acaagaccga ctggggaaaa gagattggct ggatttacgg 2580 
atcgatcaca gaggatatct tgactggatt taagatgcac tgccatggct ggcggtctat 2640 
ttactgcatc ccgaagcggc ctgcattcaa aggttctgcg cctctgaacc tttccgaccg 2700 
tcttcaccag gtccttcgct gggcccttgg gtccgtcgaa attttcttca gcaagcactg 2760 
cccactttgg tacggatacg gcggcgggct aaaattcctg gaaaggtttt cttatatcaa 2820 
ctccatcgtt tatccctgga cgtccattcc tctcctggct tactgtacct tgcctgccat 2880 
ctgcctgctc acggggaagt ttatcacacc agagcttacc aatgtcgcca gtatctggtt 2940 
catggcactt ttcatctgca tctccgtgac cggcatcctg gaaatgaggt ggagtggcgt 3000 
ggccatcgac gactggtgga ggaacgagca gttctgggtc atcggaggcg tttcggcgca 3060 
tctgttcgcg gtgttccagg gcctgctgaa ggtgttcgcc ggcatcgaca cgagcttcac 3120 
cgtgacgtcg aaggccgggg acgacgagga gttctcggag ctgtacacgt tcaagtggac 3180 
caccctgctg atacccccga ccacgctcct cctgctgaac ttcatcgggg tggtggccgg 3240 
gatctcgaac gcgatcaaca acgggtacga gtcgtggggc cccctgttcg ggaagctctt 3300 
cttcgccttc tgggtgatcg tccacctgta cccgttcctc aagggtctgg tggggaggca 3360 
gaacaggacg ccgacgatcg tcatcgtctg gtccatcctg ctggcctcga tcttctcgct 3420 
cctgtgggtc cgcgtcgacc cgttcctcgc caagagcaac ggcccgctcc tggaggagtg 34 80 
tggcctggac tgcaactgaa gtgggggccc cctgtcactc gaagttctgt cacgggcgaa 354 0 
ttacgcctga ttttttgttg ttgttgttgt tggaattctt tgctgtagat agaaaccaca 3600 
tgtccacggc atctctgctg tgtccattgg agcaggagag aggtgcctgc tgctgtttgt 3660 
tgagtaaatt aaaagtttta aagttataca gtgatgcaca ttccagtgcc cagtgtattc 3720 
cctttttaca gtctgtatat tagcgacaaa ggacatattg gttaggagtt tgattctttt 3780 
gtaaaa ' 3786 

<210> 8 

<211> 1165 

<212> PRT 

<213> Zea mays 

<400> 8 

His Ser Ser Tyr Thr Lys Ser Arg Ser Ser Leu Ala Gin Pro Arg Ala 
1 5 10 is 

Ala Pro Arg Gin Ala Gin Pro Pro Pro Ala Thr Ala Ala Cys Ala Cys 
20 25 30 

Glu Arg Ser Pro Arg Pro Gly Asp Gin Arg Arg Glv Gly Leu Arg Ala 
35 40 " 45 

Phe Arg Cys Ala Ala Ala Ala Gly Phe Val Arg Glu Arg Asp Pro Ala 
50 55 60 

Gly Arg Gly Gly Gly Pro Glu Met Glu Ala Ser Ala Gly Leu Val Ala 
65 7 0 75 " 80 

Gly Ser His Asn Arg Asn Glu Leu Val Val lie Arg Arg Asp Arg Glu 
85 90 95 

Ser Gly Ala Ala Gly Gly Gly Ala Ala Arg Arg Ala Glu Ala Pro Cys 
100 105 no 

Gin lie Cys Gly Asp Glu Val Gly Val Gly Phe Asp Gly Glu Pro Phe 
115 120 125 

Val Ala Cys Asn Glu Cys Ala Phe Pro Val Cys Arg Ala Cys Tyr Glu 
130 135 140 



10 



WO 00/04166 



PCT/US99/15871 



Tyr Glu Arg Arg Glu Gly Ser Gin Ala Cys Pro Gin Cys Arg Thr Arq 
145 150 155 160 

Tyr Lys Arg Leu Lys Gly Cys Pro Arg Val Ala Gly Asp Glu Glu Glu 
165 170 175 

Asp Gly Val Asp Asp Leu Glu Gly Glu Phe Gly Leu Gin Asp Glv Ala 
180 185 190 

Ala His Glu Asp Asp Pro Gin Tyr Val Ala Glu Ser Met Leu Arg Ala 
195 200 205 

Gin Met Ser Tyr Gly Arg Gly Gly Asp Ala His Pro Gly Phe Ser Pro 
210 215 220 

Val Pro Asn Val Pro Leu Leu Thr Asa Gly Gin Met Val Asp Asp He 
225 230 235 2 40 

Pro Pro Glu Gin His Ala Leu Val Pro Ser Tyr Met Ser Gly Glv Glv 
245 250 255 

Gly Gly Gly Lys Arg He His Pro Leu Pro Phe Ala Asd Pro Asn Leu 
260 265 ' 270 

Pro Val Gin Pro Arg Ser Met Asp Pro Ser Lys Asp Leu Ala Ala Tyr 
275 280 285 

Gly Tyr Gly Ser Val Ala Trp Lys Glu Arg Met Glu Gly Trp Lys Gin 
290 295 300 

Lys Gin Glu Arg Leu Gin His Val Arg Ser Glu Gly Gly Glv Asd Trp 
305 310 315 320 

Asp Gly Asp Asp Ala Asp Leu Pro Leu Met Asp Glu Ala Arg Gin Pro 
325 330 335 

Leu Ser Arg Lys Val Pro lie Ser Ser Ser Arg He Asn Pro Tyr Arq 
340 345 350 

Met lie He Val He Arg Leu Val Val Leu Gly Phe Phe Phe His Ty^ 
355 360 3 6 5 

Arg Val Met His Pro Ala Lys Asp Ala Phe Ala Leu Trp Leu He Ser 
370 375 3 8 o 

Val lie Cys Glu He Trp Phe Ala Met Ser Trp He Leu Asp Gin Phe 
385 390 395 400 

Pro Lys Trp Leu Pro He Glu Arg Glu Thr Tyr Leu Asp Arg Leu Ser 
405 410 415 

Leu Arg Phe Asp Lys Glu Gly Gin Pro Ser Gin Leu Ala Pro He Asp 
4 20 425 430 

Phe Phe Val Ser Thr Val Asp Pro Thr Lys Glu Pro Pro Leu Val Thr 
435 440 445 



Ala Asn Thr Val Leu Ser He Leu Ser Val Asp Tyr Pro Val Glu Lys 
450 455 460 
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Val Ser Cys Tyr Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe Glu 
465 470 475 48O 

Ala Leu Ser Glu Thr Ser Glu Phe Ala Lys Lys Trp Val Pro Phe Ser 
485 490 495 

Lys Lys Phe Asn lie Glu Pro Arg Ala Pro Glu Trp Tyr Phe Gin Gin 
500 505 5io 

Lys He Asp Tyr Leu Lys Asp Lys Val Ala Ala Ser Phe Val Arg Glu 
515 520 525 

Arg Arg Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg He Asn 
530 535 540 

Ala Leu Val Ala Lys Ala Gin Lys Val Pro Glu Glu Gly Trp Thr Met 
545 550 555 560 

Gin Asp Gly Ser Pro Trp Pro Gly Asn Asn Val Arg Asp His Pro Gly 
565 570 * 575 

Met He Gin Val Phe Leu Gly Gin Ser Gly Gly Arq Asp Val Glu Gly 
580 585 590 

Asn Glu Leu Pro Arg Leu Val Tyr Val Ser Arq Glu Lys Arg Pro Gly 
595 600 605 

Tyr Asn His His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Arq Val 
610 615 620 

Ser Ala Val Leu Ser Asn Ala Ala Tyr Leu Leu Asn Leu Asp Cys Asp 
625 630 635 640 

His Tyr He Asn Asn Ser Lys Ala He Lys Glu Ala Met Cys Phe Met 
645 650 655 

Met Asp Pro Leu Val Gly Lys Lys Val Cys Tyr Val Gin Phe Pro Gin 
660 665 670 

Arg Phe Asp Gly He Asp Lys Asn Asp Arg Tyr Ala Asn Arg Asn Val 
67 5 680 685 

Val Phe Phe Asp He Asn Met Lys Gly Leu Asp Gly He Gin Gly Pro 
690 695 700 

He Tyr Val Gly Thr Gly Cys Val Phe Arg Arg Gin Ala Leu Tyr Gly 
705 710 715 720 

Tyr Asp Ala Pro Lys Thr Lys Lys Pro Pro Ser Arg Thr Cys Asn Cys 
725 730 735 

Trp Pro Lys Trp Cys Leu Ser Cys Cys Cys Ser Arg Asn Lys Asn Lys 
740 745 750 

Lys Lys Thr Thr Lys Pro Lys Thr Glu Lys Lys Lys Arg Leu Phe Phe 
? 55 760 765 

Lys Lys Ala Glu Asn Pro Ser Pro Ala Tyr Ala Leu Gly Glu He Asp 
770 775 780 
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Glu Gly Ala Pro Gly Ala Asp lie Glu Lys Ala Gly He Val Asn Gin 
785 790 795 8 00 

Gin Lys Leu Glu Lys Lys Phe Gly Gin Ser Ser Val Phe Val Ala Ser 
805 810 815 

Thr Leu Leu Glu Asn Gly Gly Thr Leu Lys Ser Ala Ser Pro Ala Ser 
820 825 830 

Leu Leu Lys Glu Ala He His Val lie Ser Cys Gly Tyr Glu Asp Lys 
835 840 845 

Thr Asp Trp Gly Lys Glu He Gly Trp He Tyr Gly Ser He Thr Glu 
850 855 860 

Asp He Leu Thr Gly Phe Lys Met His- Cys His Gly Trp Arg Ser He 
865 870 875 880 

Tyr Cys He Pro Lys Arg Pro Ala Phe Lys Gly Ser Ala Pro Leu Asn 
885 890 8 95 

Leu Ser Asp Arg Leu His Gin Val Leu Arg Tro Ala Leu Gly Ser Val 
900 905 910 

Glu He Phe Phe Ser Lys His Cys Pro Leu Trp Tyr Gly Tyr Gly Glv 
915 920 925 

Gly Leu Lys Phe Leu Glu Arg Phe Ser Tyr He Asn Ser He Val Tyr 
930 935 940 

Pro Trp" Thr Ser He Pro Leu Leu Ala Tyr Cys Thr Leu Pro Ala He 
945 950 955 960 

Cys Leu Leu Thr Gly Lys Phe lie Thr Pro Glu Leu Thr Asn Val Ala 
965 970 975 

Ser He Trp Phe Met Ala Leu Phe He Cys He Ser Val Thr Gly He 
980 985 990 

Leu Glu Met Arg Trp Ser Gly Val Ala He Asp Asp Tro Trp Arg Asn 
995 1000 1005 

Glu Gin Phe Trp Val He Gly Gly Val Ser Ala His Leu Phe Ala Val 
1010 1015 1020 

Phe Gin Gly Leu Leu Lys Val Phe Ala Gly He Asp Thr Ser Phe Thr 
1025 1030 1035 1040 

Val Thr Ser Lys Ala Gly Asp Asp Glu Glu Phe Ser Glu Leu Tyr Thr 
1045 1050 1055 

Phe Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu Leu Leu Leu 
1060 1065 1070 

Asn Phe He Gly Val Val Ala Gly He Ser Asn Ala He Asn Asn Gly 
1075 1080 1085 

Tyr Glu Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe Trp 
1Q 90 1095 " iioo 
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Val lie Val His Leu Tyr Pro Phe Leu Lys Gly Leu Val Gly Arg G^n 
1105 1110 ins ' 1120 

Asn Arg Thr Pro Thr lie Val He Val Trp Ser He Leu Leu Ala Ser 
1125 H30 H35 

He Phe Ser Leu Leu Trp Val Arg Val Asp Pro Phe Leu Ala Lys Ser 
II 40 1145 H50 

Asn Gly Pro Leu Leu Glu Glu Cys Gly Leu Asp Cys Asn 
1155 1160 H65 

<210> 9 
<211> 3936 
<212> DNA 
<213> Zea mays 

<400> 9 

cttctccctc gtcggtgcgg cgtggcgcgg ctcggcgttc ggtgagaaac cactcggggg 60 
atgaggatct gctgctagag tgagaggagc tacggtcagt atcctctgcc ttcgtcggcg 120 
gcggaagtgg aggggaggaa gcgatggagg cgagcgccgg gctggtggcc ggctcccaca 180 
accgcaacga gctcgtcgtc atccgccgcg acggcgatcc cgggccgaag ccgccgcggg 240 
agcagaacgg gcaggtgtgc cagatttgcg gcgacgacgt cggccttgcc cccggcgggg 300 
accccttcgt ggcgtgcaac gagtgcgcct tccccgtctg ccgggactgc tacgaatacg 360 
agcgccggga gggcacgcag aactgccccc agtgcaagac tcgatacaag cgcctcaagg 420 
gctgccaacg tgtgaccggt gacgaggagg aggacggcgt cgatgacctg gacaacgagt 480 
tcaactggga cggccatgac tcgcagtctg tggccgagtc catgctctac ggccacatga 540 
gctacggccg tggaggtgac cctaatggcg cgccacaagc tttccagctc aaccccaatg 600 
ttccactcct caccaacggg caaatggtgg atgacatccc accggagcag cacgcgctgg 660 
tgccttcttt catgggtggt gggggaaaga ggatacatcc ccttccttat gcggatccca 720 
gcttacctgt gcaacccagg tctatggacc catccaagga tcttgctgca tatgggtatg 780 
gtagtgttgc ttggaaggaa cggatggaga attggaagca gagacaagag aggatgcacc 840 
agacggggaa tgatggtggt ggtgatgatg gtgacgatgc tgatctacca ctaatggatg 900 
aagcaagaca acaactgtcc aggaaaattc cacttccatc aagccagatt aatccatata 960 
ggatgattat cattattcgg cttgtggttt tggggttctt cttccactac cgagtgatgc 1020 
atccggtgaa tgatgcattt gctttgtggc tcatatctgt tatctgtgaa atctggtttg 1080 
ccatgtcttg gattcttgat caattcccaa agtggttccc tattgagaga gagacttacc 1140 
tagaccggct gtcactgagg ttcgacaagg aaggccagcc atctcaactt gctccaattg 1200 
atttctttgt cagtacggtt gatcccttaa aggaacctcc tttggtcaca acaaatactg 1260 
ttctatctat cctttcggtg gattatcctg ttgataaggt ttcttgctat gtttctgatg 1320 
atggtgctgc aatgctaacg tttgaagcat tatctgaaac atctgaattt gcaaagaaat 1380 
gggttccttt ctgcaaacgg tacaatattg aacctcgcgc tccagagtgg tacttccaac 1440 
agaagataga ctacttgaaa gacaaggtgg cagcaaactt tgttagggag aggagagcaa 1500 
tgaagagaga gtatgaggaa ttcaaggtga gaatcaatgc cttagttgcc aaagcccaga 1560 
aagttcctga agaaggatgg acaatgcaag atggaacccc ctggcctgga aacaatgttc 1620 
gtgatcatcc tggaatgatt caggtcttcc ttggccaaag cggaggcctt gactgtgagg 1680 
gaaatgaact gccacgattg gtttatgttt ctagagagaa acgaccaggc tataaccatc 1740 
ataagaaagc tggtgctatg aatgcattgg tccgagtctc tgctgtacta acaaatgctc 1800 
catatttgtt aaacttggat tgtgatcact acatcaacaa cagcaaggct ataaaggaag 18 60 
caatgtgttt tatgatggac cctttactag gaaagaaggt ttgctatgta cagttccctc 1920 
aaagatttga tgggattgat cgccatgacc gatatgctaa ccggaatgtt gtcttttttg 1980 
atatcaacat gaaaggtttg gatggtattc agggtccaat ttatgttggt actggatgtg 2040 
tatttagaag gcaggcatta tatggttatg atgcccccaa aacaaagaag ccaccatcaa 2100 
ggacttgcaa ctgctggccc aagtggtgct tttgctgttg ctgctttggc aataggaagc 2160 
aaaagaagac taccaaaccc aaaacagaga agaaaaagtt attatttttc aagaaagaag 2220 
agaaccaatc ccctgcatat gctcttggtg aaattgacga agctgctcca ggagctgaga 2280 
atgaaaaggc cggtattgta aatcaacaaa aattagaaaa gaaatttggc caatcttctg 2340 
tttttgttac atccacactt ctcgagaatg gtggaacctt gaagagtgca agtcctgctt 2400 
ctcttttgaa agaagctata catgtcatta gttgtggtta tgaagacaag acagactggg 24 60 
gaaaagagat tggctggatc tatggatcag ttacagaaga tattctaact ggtttcaaga 2520 
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tgcattgtca tggttggcgg tcaatttact gcatacctaa acgggttgca ttcaaaggtt 2580 
ctgcacctct gaatctttca gatcgtcttc accaggtgct tcggtgggct cttgggtcta 2640 
ttgagatctt cttcagcaat cattgccctc tttggtatgg gtatggtggc ggtctgaaat 2700 
ttttggaaag attttcctac atcaactcca tcgtgtatcc ttggacatct attcccctct 2760 
tggcttactg tacattgcct gccatctgtt tattgacagg gaaatttatc actccagagc 2820 
tgaataatgt tgccagcctg tggttcatgt cactttttat ctgcattttt gctacgagca 2880 
tcctagaaat gagatggagt ggtgttggaa ttgatgactg gtggaggaat gagcagttct 2940 
gggtcattgg aggtgtgtcc tcacacctct ttgctgtgtt ccagggactt ctcaaggtca 3000 
tagctggtgc tgatacaagc ttcaccgtga catcaaaggg tggagatgat gaggagttct 3060 
cagagctata tacattcaaa tggactacct tattgatacc tcctaccacc ttgcttctat 3120 
tgaacttcat tggtgtggtc gctggcgttt caaatgcgat caataacgga tatgagtcat 3180 
ggggccccct ctttgggaag ctattctttg cattttgggt gattgtccat ctttatccct 3240 
ttctcaaagg tttggttgga aggcaaaaca ggacaccaac gattgtcatc gtctggtcca 3300 
ttctgctggc ttcaatcttc tcgctccttt gggttcggat tgatcctttc cttgcgaagg 3360 
atgatggtcc gcttcttgag gagtgtggtt tggattgcaa ctaggatgtc agtgcatcag 3420 
ctcccccaat ctgcatatgc ttgaagtata ttttctggtg tttgtcccca tattcagtgt 3480 
ctgtagataa gagacatgaa atgtcccaag tttcttttga tccatggtga acctacttaa 3540 
tatctgagag atatactggg ggaaaatgga ggctgcggca atccttgtgc agttgggccg 3600 
tggaatacag catatgcaag tgtttgattg tgcagcattc tttattactt ggtcgcaata 3660 
tagatgggct gagccgaaca gcaaggtatt ttgattctgc actgctcccg tgtacaaact 3720 
tggttctcaa taaggcaggc aggaatgcat ctgccagtgg aacagagcaa cctgcacatt 3780 
atttatgtat gcctgttcat tggagggctt gttcattaca tgttcgtcta tactagaaaa 3840 
aacagaatat tagcattaat ctatagttaa ttaaagtatg taaatgcgcc tgttttttgt 3900 
tgtgtactgt aatcatctga gttggttttg tgaaaa 3936 

<210> 10 

<211> 1086 

<212> PRT 

<213> Zea mays 

<400> 10 

Met Glu Ala Ser Ala Gly Leu Val Ala Gly Ser His Asn Arg Asn Glu 
15 10 15 

Leu Val Val He Arg Arg Asp Gly Asp Pro Gly Pro Lys Pro Pro Ara 
20 25 30 

Glu Gin Asn Gly Gin Val Cys Gin He Cys Gly Asp Asp Val Gly Leu 



35 40 



45 



Ala Pro Gly Gly Asp Pro Phe Val Ala Cys Asn Glu Cys Ala Phe Pro 
50 55 60 

Val Cys Arg Asp Cys Tyr Glu Tyr Glu Arg Arg Glu Gly Thr Gin Asn 
65 70 75 80 

Cys Pro Gin Cys Lys Thr Arg Tyr Lys Arg Leu Lys Gly Cys Gin Arg 
85 90 ' 95 

Val Thr Gly Asp Glu Glu Glu Asp Gly Val Asp Asp Leu Asp Asn Glu 
100 105 no 

Phe Asn Trp Asp Gly His Asp Ser Gin Ser Val Ala Glu Ser Met Leu 
H5 120 125 

Tyr Gly His Met Ser Tyr Gly Arg Gly Gly Asp Pro Asn Gly Ala Pro 
I 30 135 140 

Gin Ala Phe Gin Leu Asn Pro Asn Val Pro Leu Leu Thr Asn Gly Gin 
145 150 155 160 
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Met Val Asp Asp He Pro Pro Glu Gin His Ala Leu Val Pro Ser Phe 
165 170 " 175 

Met Gly Gly Gly Gly Lys Arg He His Pro Leu Pro Tyr Ala Asp Pro 
180 185 X90 

Ser Leu Pro Val Gin Pro Arg Ser Met Asp Pro Ser Lys Asp Leu Ala 
195 200 205 

Ala Tyr Gly Tyr Gly Ser Val Ala Trp Lys Glu Arg Met Glu Asn Trp 
210 215 .220 

Lys Gin Arg Gin Glu Arg Met His Gin Thr Gly Asn Asp Gly Glv Glv 
225 230 235 240 

Asp Asp Gly Asp Asp Ala Asp Leu Pro Leu Met Asp Glu Ala Arg Gin 
245 250 * 255 

Gin Leu Ser Arg Lys He Pro Leu Pro Ser Ser Gin He Asn Pro Tyr 
260 265 270 

Arg Met He He He He Arg Leu Val Val Leu Gly Phe Phe Phe His 
275 280 285 

Tyr Arg Val Met His Pro Val Asn Asp Ala Phe Ala Leu Trp Leu He 
290 295 300 

Ser Val He Cys Glu He Trp Phe Ala Met Ser Trp He Leu Asp Gin 
305 310 315 320 

Phe Pro Lys Trp Phe Pro He Glu Arg Glu Thr Tyr Leu Asp Arg Leu 
325 330 335 

Ser Leu Arg Phe Asp Lys Glu Gly Gin Pro Ser Gin Leu Ala Pro He 
340 345 350 

Asp Phe Phe Val Ser Thr Val Asp Pro Leu Lys Glu Pro Pro Leu Val 
355 360 365 

Thr Thr Asn Thr Val Leu Ser He Leu Ser Val Asp Tyr Pro Val Asp 
370 375 380 

Lys Val Ser Cys Tyr Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe 
385 390 395 400 

Glu Ala Leu Ser Glu Thr Ser Glu Phe Ala Lys Lys TrD Val Pro Phe 
405 410 " 415 

Cys Lys Arg Tyr Asn He Glu Pro Arg Ala Pro Glu Trp Tyr Phe Gin 
420 425 430 

Gin Lys He Asp Tyr Leu Lys Asp Lys Val Ala Ala Asn Phe Val Arg 
435 440 445 

Glu Arg Arg Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg He 
450 4 55 4 60 

Asn Ala Leu Val Ala Lys Ala Gin Lys Val Pro Glu Glu Gly Trp Thr 
465 470 475 480 
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Met Gin Asp Gly Thr Pro Trp Pro Gly Asn Asn Val Arg Asp His Pro 
485 490 495 

Gly Met lie Gin Val Phe Leu Gly Gin Ser Gly Gly Leu Asp Cys Glu 
500 505 510 

Gly Asn Glu Leu Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro 
515 520 525 

Gly Tyr Asn His His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Ara 
530 535 540 

Val Ser Ala Val Leu Thr Asn Ala Pro Tyr Leu Leu Asn Leu Asp Cvs 
545 550 555 560 

Asp His Tyr lie Asn Asn Ser Lys Ala He Lys Glu Ala Met Cys Phe 
565 570 575 

Met Met Asp Pro Leu Leu Gly Lys Lys Val Cys Tyr Val Gin Phe Pro 
580 585 590 

Gin Arg Phe Asp Gly He Asp Arg His Asp Arg Tyr Ala Asn Arg Asn 
5 95 600 6 05 

Val Val Phe Phe Asp He Asn Met Lys Gly Leu Asp Gly He Gin Gly 
610 615 620 

Pro He Tyr Val Gly Thr Gly Cys Val Phe Arg Arg Gin Ala Leu Tyr 
625 630 635 640 

Gly Tyr Asp Ala Pro Lys Thr Lys Lys Pro Pro Ser Arg Thr Cys Asn 
645 650 655 

Cys Trp Pro Lys Trp Cys Phe Cys Cys Cys Cys Phe Gly Asn Arg Lys 
660 665 " 670 

Gin Lys Lys Thr Thr Lys Pro Lys Thr Glu Lvs Lys Lys Leu Leu Phe 
675 680 685 

Phe Lys Lys Glu Glu Asn Gin Ser Pro Ala Tyr Ala Leu Gly Glu He 
690 695 700 

Asp Glu Ala Ala Pro Gly Ala Glu Asn Glu Lys Ala Gly He Val Asn 
705 710 715 720 

Gin Gin Lys Leu Glu Lys Lys Phe Gly Gin Ser Ser Val Phe Val Thr 
725 730 735 

Ser Thr Leu Leu Glu Asn Gly Gly Thr Leu Lys Ser Ala Ser Pro Ala 
7 40 745 750 

Ser Leu Leu Lys Glu Ala He His Val lie Ser Cys Gly Tyr Glu Asp 
755 760 765 

Lys Thr Asp Trp Gly Lys Glu He Gly Trp He Tyr Gly Ser Val Thr 
770 775 780 

Glu Asp He Leu Thr Gly Phe Lys Met His Cys His Gly Trp Arq Ser 
785 790 795 800 
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He Tyr Cys He Pro Lys Arg Val Ala Phe Lys Gly Ser Ala Pro Leu 
805 810 " " 815 

Asn Leu Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly Ser 
820 825 * 830 

He Glu He Phe Phe Ser Asn His Cys Pro Leu Trp Tyr Gly Tyr Gly 
835 840 845 

Gly Gly Leu Lys Phe Leu Glu Arg Phe Ser Tyr He Asn Ser He Val 
850 855 860 

Tyr Pro Trp Thr Ser He Pro Leu Leu Ala Tyr Cys Thr Leu Pro Ala 
865 870 875 880 

He Cys Leu Leu Thr Gly Lys Phe He Thr Pro Glu Leu Asn Asn Val 
885 890 895 

Ala Ser Leu Trp Phe Met Ser Leu Phe He Cys He Phe Ala Thr Ser 
900 905 910 

He Leu Glu Met Arg Trp Ser Gly Val Gly He Asp Aso Trp Trp Arg 
915 920 925 

Asn Glu Gin Phe Trp Val He Gly Gly Val Ser Ser His Leu Phe Ala 
930 935 940 

Val Phe Gin Gly Leu Leu Lys Val He Ala Gly Val Asp Thr Ser Phe 
945 950 955 960 

Thr Val Thr Ser Lys Gly Gly Asp Asp Glu Glu Phe Ser Glu Leu Tyr 
965 970 975 

Thr Phe Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu Leu Leu 
980 985 990 

Leu Asn Phe He Gly Val Val Ala Gly Val Ser Asn Ala He Asn Asn 
995 1000 1005 

Gly Tyr Glu Ser Trp Gly Pro Leu Phe Gly Lvs Leu Phe Phe Ala Phe 
1010 1015 1020 

Trp Val He Val His Leu Tyr Pro Phe Leu Lys Gly Leu Val Gly Arg 
1025 1030 1035 1040 

Gin Asn Arg Thr Pro Thr He Val He Val Trp Ser lie Leu Leu Ala 
1045 1050 1055 

Ser He Phe Ser Leu Leu Trp Val Arg lie Asp Pro Phe Leu Ala Lys 
1060 1065 1070 

Asp Asp Gly Pro Leu Leu Glu Glu Cys Gly Leu Asp Cys Asn 
1075 1080 1085 

<210> 11 

<211> 1138 

<212> DNA 

<213> Oryza sativa 
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<400> 11 

cgctgctccc ggcgatggac gtgttcgtca 
tcgccacggc gaacaccgtg ctgtccatat 
gtgacgtgct atatttccga cgacgcaggc 
gcggcccggt tcgcggcgct ttgggtgtcg 
aacctggagg cgtacttcaa cgccggcgag 
agggggagct acagggggat ggcgtggccg 
cgcgagtacg aggagatgcg gctgcggatc 
cggcgccgcg gcgcggccga tgaccacgcc 
gggagcgtgc cacagctcgg cgttgcgaac 
gacgtgtgcc tcccggcgct tgtgtacgtg 
caccggaagg cgggcgccat gaacgcgccc 
gtcaacaact cgcaggccct ccgcgccggc 
ggagccgccg aagacgccgg cgcggtcgcg 
gtcgatcccg gcgaccgcta cgccaaccac 
ggcctcgacg gcctccaggg ccccatctac 
gcgctctaca gcgtcgacct gccgcgctgg 
ctcggagaag acgagcggct atggtccagg 
aggtgaaaaa ctcagctaaa acctgaccca 
aaaatgaaat ttactttttt ttttttacca 

<210> 12 

<211> 341 

<212> PRT 

<213> Oryza sativa 



ccaccgccga ccccgacaag gagccgccgc 60 
atcctcgccg cgggctaccc cgccggcaag 120 
gcggaggtga cacgtaacgc ggtcgtggag 180 
ttctgccgga agcacggcgt cgagccgagg 240 
ggtggtggtg gcaaggcgaa ggtggtggcg 300 
gagctggtgc gcgacaggag acgggtgcgc 360 
gacgcgctgc aggccgccga tgcgcgccgc 420 
ggagttgtgc aggtactgat cgattttgct 480 
gggagcaagc tcatcgacgt cgcctctgtc 540 
tgccgcgaga agcgccgcgg ccacgcgcac 600 
ttcatcctcg acctcgactg cgactactac 660 
atctgcttca tgatcgaacg cggcggcggc 720 
ttcgtccagt tcccgcagcg ggtcgacggc 780 
aaccgcgtcc tcttcgactg caccgagctc 840 
gtcggcaccg gctgcttgtt ccgccgtgtc 900 
agaccgcggc gttcattggg ctgtcgccta 960 
atgaaacaaa tggtaatatt aagtggtcca 1020 
agctgtaaca tgggtaaaaa tatatggccc 1080 
aaaaaaaaaa aaaaaaaaaa aaaaaaaa 1138 



<400> 12 

Arg Cys Ser Arg Arg Trp Thr Cys Ser Ser Pro Pro Pro Thr Pro Thr 
15 10 15 

Arg Ser Arg Arg Ser Pro Arg Arg Thr Pro Cys Cys Pro Tyr lie Leu 
20 25 30 

Ala Ala Gly Tyr Pro Ala Gly Lys Val Thr Cys Tyr He Ser Asp Asp 
35 40 ' 45 

Ala Gly Ala Glu Val Thr Arg Asn Ala Val Val Glu Ala Ala Arq Phe 
50 55 60 

Ala Ala Leu Trp Val Ser Phe Cys Arg Lys His Gly Val Glu Pro Ara 
65 70 75 80 

Asn Leu Glu Ala Tyr Phe Asn Ala Gly Glu Gly Gly Gly Gly Lys Ala 
85 90 95 

Lys Val Val Ala Arg Gly Ser Tyr Arg Gly Met Ala Trp Pro Glu Leu 
100 105 no 

Val Arg Asp Arg Arg Arg Val Arg Arg Glu Tyr Glu Glu Met Arg Leu 
115 120 125 

Arg He Asp Ala Leu Gin Ala Ala Asp Ala Arg Arg Arg Arg Arg Gly 
130 135 140 



Ala Ala Asp Asp His Ala Gly Val Val Gin Val Leu He Asp Phe Ala 
145 150 155 160 

Gly Ser Val Pro Gin Leu Gly Val Ala Asn Gly Ser Lys Leu lie Asp 

165 no ' 175 
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Val Ala Ser Val Asp Val Cys Leu Pro Ala Leu Val Tyr Val Cys Arg 
180 185 190 

Glu Lys Arg Arg Gly His Ala His His Arg Lys Ala Gly Ala Met Asn 
195 200 205 

Ala Pro Phe He Leu Asp Leu Asp Cys Asp Tyr Tyr Val Asn Asn Ser 
210 215 220 

Gin Ala Leu Arg Ala Gly He Cys Phe Met He Glu Arg Gly Glv Glv 
225 230 2 35 * * 24Q 

Gly Ala Ala Glu Asp Ala Gly Ala Val Ala Phe Val Gin Phe Pro Gin 
245 250 255 

Arg Val Asp Gly Val Asp Pro Gly Asp Arg Tyr Ala Asn His Asn Arg 
260 265 270 

Val Leu Phe Asp Cys Thr Glu Leu Gly Leu Asp Gly Leu Gin Gly Pro 
275 280 285 

lie Tyr Val Gly Thr Gly Cys Leu Phe Arg Arg Val Ala Leu Tyr Ser 
290 295 * 300 

Val Asp Leu Pro Arg Trp Arg Pro Arg Arg Ser Leu Gly Cys Arq Leu 
305 310 315 320 

Leu Gly Glu Asp Glu Arg Leu Trp Ser Arg Met Lys Gin Met Val He 
325 330 335 



Leu Ser Gly Pro Arg 
340 



<210> 13 

<211> 3517 

<212> DNA 

<213> Glycine max 



<400> 13 

gcacgagcca acaacaacac ccttatgtgg 
caatcttcct tcataaaaca caaacctttg 
tgttgtgcca ttcccatatt gtcccattca 
cgctgggtca cataaccgca atgagctagt 
tttgaagaac ttggatgggc aagtgtgtga 
ggatggagac ttgtttgtgg cttgcaatga 
tgagtatgaa aggagagaag gaagccacct 
tctcaaaggg agcccccgag tggagggaga 
gcatgaattc aatattgatg agcaaaagaa 
tcatgggagg atgagctatg gaagaggtcc 
acctgtcatt gctggtggtc gttctaggcc 
tgcttatggg gatcagatgt tatcctcttc 
gtctgaacct ggaagtgcaa gatgggacga 
tggatgactg gaaattgcag caaggcaatt 
cagccatgtt agatgaagca aggcaaccac 
aaatcaatcc atatagaatg gtgattgtgg 
gatacagact catgaaccca gtacatgatg 
gtgaaatctg gtttgctttt tcatggattc 
atagagagac ctaccttgac cgtctttcca 
tgcttgctcc tgtagatgtt tttgttagta 
ttacagcaaa cactgttctt tcaatcttgg 



acacattagg tgaggttcaa cagctagcac 60 
atcacacaat ctcaccttaa tttgtgttgt 120 
ctaagacatg gaagccagcg ctggactggt 180 
tgtcattcat ggccatgaag agccgaaggc 240 
gatttgtggt gatggcgtgg gactcacggt 300 
gtgtggtttt ccagtgtgca ggccttgcta 360 
ttgcccacag tgcaaaacca gatacaagcg 420 
tgatgatgaa gaggatgtgg atgatattga 480 
caagcatggc caggttgcag aagccatgct 54 0 
tgaagatgat gacaattccc agttcccaac 600 
tgtaagtggg gagttcccaa tatcatctaa 660 
actgcataaa agagtgcatc catatccagt 720 
aaaaaaaaga agatggatgg aaagatagaa 780 
tggggcctga accggatgaa gatccagatg 84 0 
tgtcaaggaa agtgccaata gcatccagca 900 
cacgtctggt tattcttgct ttcttcctca 960 
ccctggggct atggctaacc tctatcatat 1020 
tggatcagtt tcccaaatgg tttcccattg 1080 
tcaggtatga gcgtgaaggt gaaccaaaca 1140 
ccgtggatcc catgaaggaa cctcctctgg 1200 
ccatggatta cccggttgat aaaatatcat 1260 
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gctacatttc tgatgatgga gcctcaatgt gtacatttga gtccttatca gaaactgcag 1320 
agtttgctag aaagtgggta ccgttttgta agaaattttc catagaacct cgggcacctq 1380 
agatgtactt cagcgagaag attgactacc taaaggacaa agtgcaaccc acctttgtta 1440 
aggagcgtcg agctatgaag agggaatacg aagagtttaa ggttaggatc aatgcacttg 1500 
ttgctaaggc ccagaaagtt cctcagggag gatggatcat gcaggatggg acaccatggc 1560 
cagggaataa cactaaggat catcctggta tgattcaagt gtttcttggt agcagtggag 1620 
gtcttgatac tgaaggaaac caacttcctc gccttgttta rgtttccaga gagaaaaggc 1680 
ctggttttca acaccacaag aaagctggtg ccatgaatgc tctggttcgg gtatctgctg 1740 
ttctcacaaa tgctcctttc atgttgaact tggattgtga tcactatgtc aataacagca 1800 
aggctgcccg agaggccatg tgcttcttga tggacccaca aactgggaag aaggtctgct 1860 
atgtccagtt tcctcaaaga tttgatggta ttgatacaca tgatcgttat gccaacagga 1920 
acacagtttt ctttgatatt aacatgaagg gtctagatgg tattcaaggt cctgtatatg 1980 
tggggactgg atgtgttttc aggaggcaag ctttgtatgg ctataatcct cccaagggtc 2040 
caaagcgtcc aaaaatggta agctgtgatt gttgcccgtg ttttggaagc cgcaagaagt 2100 
ataaggagaa gaatgatgca aatggagagg ctgcaagcct aaaagggatg gatgatgaca 2160 
aagaggtgtt gatgtcccaa atgaattttg agaagaaatt tggacaatcc tctatttttg 2220 
tgacttctac cttgatggaa gagggtggtg tgcctccttc ttcaagtcca gctgccctgc 2280 
ttaaagaagc cattcatgtg attagctgtg gatatgaaga taaaactgaa tggggacttg 2340 
agcttggttg gatctatgga tctatcacag aagatattct aacaggtttt aagatgcatt 2400 
gccgtgggtg gaggtccatt tattgtatgc caaagagagc tgcattcaag ggtactgctc 2460 
ctatcaactt gtcagatcgt ctcaaccagg ttcttcgttg ggcacttggt tccattgaga 2520 
ttttctttag tcaccattgc cctctatggt atggcttcaa ggaaaagaag ctaaagtggc 2580 
ttgagagatt tgcctatgca aacacaactg tctatccatt cacctccatt cctctagttg 2640 
cctactgtat tcttccagca gtttgtttac tcactgacaa attcatcatg ccaccgatta 2700 
gcacctttgc tggtttgtac tttgttgctc tcttctcctc aatcattgca actggtattc 2760 
ttgagttgaa atggagtgga gtgagcattg aggaatggtg gagaaatgag cagttttggg 2820 
tcattggtgg tgtatcagct cacctctttg ctgttataca aggtctgcta aaggttctgg 2880 
ctggaattga caccaatttc actgttacat caaaggcaac agatgatgaa gagtttggag 2940 
aattgtacac ctttaagtgg actacactct tgattcctcc aaccactatt ttgatcatta 3000 
acattgttgg tgttgttgct ggaatctcag atgccataaa caatgggtac caatcctggg 3060 
gaccactctt tggaaagctc ttcttttcct tctgggtgat tgtccatctc tatccattcc 3120 
ttaaaggttt gatgggtcgc caaaatcgca cacccaccat tgttgtgatt tggtcagtgc 3180 
tattggcctc tattttctcc ttactttggg taagaattga tccatttgtc ctcaagacta 3240 
agggacctga taccaagcta tgtggaatca actgctaaaa aagactgctt tccctatagt 3300 
attattcttt aaaagatgta tgtagggtac atacattctt ggtttcacaa accaacaaag 3360 
tggcaatgca caaggatcaa taaggaaaga gtgaaaattt tgtgtatcat aaatgagtgt 3420 
tatcattttt gtaaatgttc tcaaggacat ctgttttggt rggaactgcc caaaaattgc 3480 
agttttatct attcactgga aaaaaaaaaa aaaaaaa 3517 



<210> 


14 


<211> 


1039 


<212> 


PRT 


<213> 


Glycine 


<220> 




<221> 


UNSURE 


<222> 


(201) 


<400> 


14 



Met Glu Ala Ser Ala Gly Leu Val Ala Gly Ser His Asn Arg Asn Glu 
15 10 15 

Leu Val Val lie His Gly His Glu Glu Pro Lys Ala Leu Lys Asn Leu 
20 25 30 

Asp Gly Gin Val Cys Glu He Cys Gly Asp Gly Val Gly Leu Thr Val 
35 40 45 

Asp Gly Asp Leu Phe Val Ala Cys Asn Glu Cys Gly Phe Pro Val Cvs 
50 55 60 
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Arg Pro Cys Tyr Glu Tyr Glu Arg Arg Glu Gly Ser His Leu Cys Pro 
65 70 75 80 

Gin Cys Lys Thr Arg Tyr Lys Arg Leu Lys Gly Ser Pro Arg Val Glu 
85 go 95 

Gly Asp Asp Asp Glu Glu Asp Val Asp Asp lie Glu His Glu Phe Asn 
100 105 no 

He Asp Glu Gin Lys Asn Lys His Gly Gin Val Ala Glu Ala Met Leu 
115 120 125 

His Gly Arg Met Ser Tyr Gly Arg Gly Pro Glu Asp Asp Asp Asn Ser 
130 135 140 

Gin Phe Pro Thr Pro Val lie Ala Gly Gly Arg Ser Arg Pro Val Ser 
145 150 155 leo 

Gly Glu Phe Pro lie Ser Ser Asn Ala Tyr Gly Asp Gin Met Leu Ser 
165 no i7 5 

Ser Ser Leu His Lys Arg Val His Pro Tyr Pro Val Ser Glu Pro Gly 
180 185 190 

Ser Ala Arg Trp Asp Glu Lys Lys Xaa Asp Gly Trp Lys Asp Arq Met 
195 200 205 

Asp Asp Trp Lys Leu Gin Gin Gly Asn Leu Gly Pro Glu Pro Asp Glu 
210 215 220 

Asp Pro Asp Ala Ala Met Leu Asp Glu Ala Arg Gin Pro Leu Ser Arq 
225 230 235 240 

Lys Val Pro He Ala Ser Ser Lys He Asn Pro Tyr Arg Met Val He 
245 250 " ' 255 

Val Ala Arg Leu Val He Leu Ala Phe Phe Leu Arg Tyr Arg Leu Met 
260 265 270 

Asn Pro Val His Asp Ala Leu Gly Leu Trp Leu Thr Ser He He Cvs 
275 280 285 

Glu He Trp Phe Ala Phe Ser Trp He Leu Asp Gin Phe Pro Lys Trp 
2 90 295 300 

Phe Pro He Asp Arg Glu Thr Tyr Leu Asp Arg Leu Ser lie Arg Tyr 
305 310 315 320 

Glu Arg Glu Gly Glu Pro Asn Met Leu Ala Pro Val Asp Val Phe Val 
325 330 335 

Ser Thr Val Asp Pro Met Lys Glu Pro Pro Leu Val Thr Ala Asn Thr 
340 345 350 

Val Leu Ser lie Leu Ala Met Asp Tyr Pro Val Asp Lys He Ser Cys 
355 360 365 

Tyr He Ser Asp Asp Gly Ala Ser Met Cys Thr Phe Glu Ser Leu Ser 
370 375 380 
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Glu Thr Ala Glu Phe Ala Arg Lys Trp Val Pro Phe Cys Lys Lys Phe 
385 390 395 ~ 400 

Ser He Glu Pro Arg Ala Pro Glu Met Tyr Phe Ser Glu Lys He Asp 
405 410 415 

Tyr Leu Lys Asp Lys Val Gin Pro Thr Phe Val Lys Glu Arg Arg Ala 
420 425 430 

Met Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg He Asn Ala Leu Val 
435 440 445 

Ala Lys Ala Gin Lys Val Pro Gin Gly Gly Trp He Met Gin Asp Glv 
450 455 460 

Thr Pro Trp Pro Gly Asn Asn Thr Lys Asp His Pro Gly Met He Gin 
465 470 475 J 48Q 

Val Phe Leu Gly Ser Ser Gly Gly Leu Asp Thr Glu Gly Asn Gin Leu 
485 490 495 

Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro Gly Phe Gin His 
500 505 510 

His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Arg Val Ser Ala Val 
515 520 525 

Leu Thr Asn Ala Pro Phe Met Leu Asn Leu Asp Cys Asp His Tyr Val 
530 535 540 

Asn Asn Ser Lys Ala Ala Arg Glu Ala Met Cys Phe Leu Met Asp Pro 
545 550 555 560 

Gin Thr Gly Lys Lys Val Cys Tyr Val Gin Phe Pro Gin Arg Phe Asp 
565 570 575 

Gly He Asp Thr His Asp Arg Tyr Ala Asn Arg Asn Thr Val Phe Phe 
580 585 590 

Asp He Asn Met Lys Gly Leu Asp Gly He Gin Gly Pro Val Tyr Val 
595 600 6 05 

Gly Thr Gly Cys Val Phe Arg Arg Gin Ala Leu Tyr Gly Tyr Asn Pro 
610 615 620 

Pro Lys Gly Pro Lys Arg Pro Lys Met Val Ser Cys Asp Cys Cys Pro 
625 630 635 640 

Cys Phe Gly Ser Arg Lys Lys Tyr Lys Glu Lys Asn Asp Ala Asn Gly 
645 650 655 

Glu Ala Ala Ser Leu Lys Gly Met Asp Asp Asp Lys Glu Val Leu Met 
660 665 ' 670 

Ser Gin Met Asn Phe Glu Lys Lys Phe Gly Gin Ser Ser He Phe Val 
675 680 685 

Thr Ser Thr Leu Met Glu Glu Gly Gly Val Pro Pro Ser Ser Ser Pro 
690 695 700 
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Ala Ala Leu Leu Lys Glu Ala He His Val He Ser Cvs Gly Tyr Glu 
705 710 715 ' 720 

Asp Lys Thr Glu Trp Gly Leu Glu Leu Gly Trp He Tyr Gly Ser lie 
725 730 735 

Thr Glu Asp He Leu Thr Gly Phe Lys Met His Cys Arg Gly Trp Arg 
740 745 750 

Ser He Tyr Cys Met Pro Lys Arg Ala Ala Phe Lys Gly Thr Ala Pro 
755 760 765 

He Asn Leu Ser Asp Arg Leu Asn Gin Val Leu Arg Trp Ala Leu Glv 
770 775 780 

Ser lie Glu He Phe Phe Ser His His Cys Pro Leu Trp Tyr Gly Phe 
785 790 795 800 

Lys Glu Lys Lys Leu Lys Trp Leu Glu Arg Phe Ala Tyr Ala Asn Thr 
805 810 J 815 

Thr Val Tyr Pro Phe Thr Ser He Pro Leu Val Ala Tyr Cys He Leu 
820 825 830 

Pro Ala Val Cys Leu Leu Thr Asp Lys Phe He Met Pro Pro He Ser 
835 840 845 

Thr Phe Ala Gly Leu Tyr Phe Val Ala Leu Phe Ser Ser He He Ala 
850 855 860 

Thr Gly He Leu Glu Leu Lys Trp Ser Gly Val Ser He Glu Glu Trp 
865 870 8 75 88 0 

Trp Arg Asn Glu Gin Phe Trp Val He Gly Glv Val Ser Ala His Leu 
885 890 895 

Phe Ala Val He Gin Gly Leu Leu Lys Val Leu Ala Gly lie Asp Thr 
900 905 910 

Asn Phe Thr Val Thr Ser Lys Ala Thr Asp Asp Glu Glu Phe Gly Glu 
915 920 925 

Leu Tyr Thr Phe Lys Trp Thr Thr Leu Leu lie Pro Pro Thr Thr lie 
930 935 940 

Leu lie lie Asn lie Val Gly Val Val Ala Gly lie Ser Asp Ala lie 
94 5 950 955 9 6 o 

Asn Asn Gly Tyr Gin Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe 
965 970 975 

Ser Phe Trp Val lie Val His Leu Tyr Pro Phe Leu Lys Gly Leu Met 
980 985 990 

Gly Arg Gin Asn Arg Thr Pro Thr lie Val Val lie Trp Ser Val Leu 
995 1000 1005 

Leu Ala Ser lie Phe Ser Leu Leu Trp Val Arg lie Asp Pro Phe Val 
1010 1015 1020 
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Leu Lys Thr Lys Gly Pro Asp Thr Lys Leu Cys Gly He Asn Cys 
1025 1030 1035 

<210> 15 

<211> 2125 

<212> DNA 

<213> Glycine max 

<400> 15 

gccaaagctc agaagatgcc agaggaaggt tggacaatgc aggatggaac tccttggcct 60 
ggaaataatc ctagggatca tccgggaatg attcaggtgt ttttaggtca tagtgggggg 120 
ctggatacag atggaaatga gctgcctaga cttgtttatg tttctcgtga gaagcgacca 180 
ggcttccaac atcacaagaa ggctggagct atgaatgctt tgattcgagt ttctgctgtc 240 
ttgaccaatg gtgcatatct tctgaatgtg gattgtgatc actatttcaa taatagcaaa 300 
gccctcaaag aagccatgtg tttcatgatg gatcctgttc ttggaaagaa gacatgctat 360 
gttcaatttc ctcagagatt tgacggcatt gacttgcacg atcgatatgc caatcgcaat 420 
attgtgttct ttgatatcaa catgaaaggt caggatggtg ttcagggccc agtctatgtg 4 80 
ggaactggtt gttgtttcaa taggcaagct ttgtatggtt atgatcctgt tttgactgag 540 
gaagatttgg aacctaacat tattgtaaag agttgttgcg gttctagaaa gaagggaaag 600 
ggtggcaata agaagtacag tgacaagaag aaggcgatgg gaagaactga atccactgta 660 
cccatattta atatggaaga catagaggag ggtgttgaag gttatgatga tgaaaggaca 720 
ctacttatgt ctcaaaagag cttggagaag cgttttggtc agtctccagt ttttattgct 780 
gccactttca tggagcaggg tggcattcca ccttcaacga accctgcaac tcttcttaag 84 0 
gaagcaatcc atgttatcag ctgtggttac gaagacaaga cagaatgggg caaagagatt 900 
ggatggatct atggctctgt gacagaagat atcttgactg ggttcaagat gcatgctcgt 960 
ggttggattt ccatctattg catgccacct cgcccagcat ttaagggttc tgctcctatc 1020 
aatctttctg atcgtctcaa tcaggtgctt cggtgggcct tgggttcaat tgagatcttt 1080 
ctaagcaggc attgtccctt gtggtatggc tacaatggga agttgaagcc tctgatgagg 1140 
cttgcttata ttaacaccat tgtctacccg tttacctcaa tcccattgat tgcttactgt 1200 
acgcttcctg cattttgtct tctcacaaat aaatttatta ttcctgagat aagcaacttt 1260 
gccagtatgt ggttcattct tctctttgtc tccattttta ccacttcaat tcttgagctt 1320 
aggtggagtg gggtcagtat agaagactgg tggagaaatg aacagttctg ggttatcggt 1380 
gggacatctg cgcatctctt tgctgtgttc caggggcttc taaaagtgct tgctgggatc 1440 
gatacaaatt ttactgttac atcgaaggca tcggacgagg atggggactt tgccgagctt 1500 
tatgtgttta aatggacatc acttctcatc cctcctacaa cagtgcttat tgtgaatttg 1560 
gttgggattg tggctggtgt atcctatgcc ataaacagtg gttaccagtc ttggggtcca 1620 
ctatttggca agctgttctt tgctatctgg gtcattgccc atctataccc attcttgaag 1680 
ggtctcttgg gcaggcaaaa tcgtacccca accattgtta ttgtttggtc cgttcttctt 1740 
gcttcaatat tctccttgct gtgggtgagg attgatccct tcacctctga ctccaacaaa 1800 
ttaaccaatg gtcaatgtgg catcaactgt tagttctctt gtatgattca ttttgtgttg 1860 
ttattccctt ttgcttggag atacacaagg ttgctgtcgt gtatatagca agaattttca 1920 
gcctatcaaa gttgtctgga ggattgaacc cctgaaatag atgggaatgt accctctctg 1980 
tttctattat ttatctacat gttccttaca agaatagtca gtagtaatgt tgaggtgtat 2040 
gttatatttt ttccccacag aatataaatt tgttcatgcg aatatttaat gaaagccaac 2100 
aaggtcctgt gttgttttgt tcttt 212 5 

<210> 16 

<211> 610 

<212> PRT 

<213> Glycine max 

<400> 16 

Ala Lys Ala Gin Lys Met Pro Glu Glu Gly Trp Thr Met Gin Asp Gly 
1 5 10 15 

Thr Pro Trp Pro Gly Asn Asn Pro Arg Asp His Pro Gly Met He Gin 
20 25 30 



25 



WO 00/04166 



PCT/US99/15871 



Val Phe Leu Gly His Ser Gly Gly Leu Asp Thr Asp Gly Asn Glu Leu 
35 40 45 

Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro Gly Phe Gin His 
50 55 60 

His Lys Lys Ala Gly Ala Met Asn Ala Leu He Arg Val Ser Ala Val 
65 70 75 80 

Leu Thr Asn Gly Ala Tyr Leu Leu Asn Val Asp Cys Asp His Tyr Phe 
85 90 * 95 

Asn Asn Ser Lys Ala Leu Lys Glu Ala Met Cys Phe Met Met Asp Pro 
100 105 no 

Val Leu Gly Lys Lys Thr Cys Tyr Val Gin Phe Pro Gin Arg Phe Asp 
115 120 125 

Gly He Asp Leu His Asp Arg Tyr Ala Asn Arg Asn He Val Phe Phe 
130 135 140 

Asp He Asn Met Lys Gly Gin Asp Gly Val Gin Gly Pro Val Tyr Val 
145 150 155 160 

Gly Thr Gly Cys Cys Phe Asn Arg Gin Ala Leu Tyr Gly Tyr Asp Pro 
165 170 175 

Val Leu Thr Glu Glu Asp Leu Glu Pro Asn He He Val Lys Ser Cys 
180 185 190 

Cys Gly Ser Arg Lys Lys Gly Lys Gly Gly Asn Lys Lys Tyr Ser Asp 
195 200 205 

Lys Lys Lys Ala Met Gly Arg Thr Glu Ser Thr Val Pro He Phe Asn 
210 215 220 

Met Glu Asp He Glu Glu Gly Val Glu Gly Tyr Asp Asp Glu Arg Thr 
225 230 235 240 

Leu Leu Met Ser Gin Lys Ser Leu Glu Lys Arg Phe Gly Gin Ser Pro 
24 5 250 ~ 255 

Val Phe He Ala Ala Thr Phe Met Glu Gin Gly Gly He Pro Pro Ser 
2 60 265 270 

Thr Asn Pro Ala Thr Leu Leu Lys Glu Ala He His Val He Ser Cys 
275 280 285 

Gly Tyr Glu Asp Lys Thr Glu Trp Gly Lys Glu He Gly Trp He Tyr 
290 295 300 



Gly Ser Val Thr Glu Asp He Leu Thr Gly Phe Lys Met His Ala Arq 
305 310 315 320 

Gly Trp He Ser He Tyr Cys Met Pro Pro Arg Pro Ala Phe Lys Gly 
325 330 335 

Ser Ala Pro He Asn Leu Ser Asp Arg Leu Asn Gin Val Leu Arg Trp 
340 345 350 
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Ala Leu Gly Ser lie Glu He Phe Leu Ser Arg His Cys Pro Leu Trp 
355 360 ' 365 

Tyr Gly Tyr Asn Gly Lys Leu Lys Pro Leu Met Arg Leu Ala Tyr He 
370 375 380 

Asn Thr He Val Tyr Pro Phe Thr Ser He Pro Leu He Ala Tyr Cvs 
385 390 395 400 

Thr Leu Pro Ala Phe Cys Leu Leu Thr Asn Lys Phe He He Pro Glu 
405 410 415 

He Ser Asn Phe Ala Ser Met Trp Phe He Leu Leu Phe Val Ser He 
420 425 430 

Phe Thr Thr Ser He Leu Glu Leu Arg Trp Ser Gly Val Ser He Glu 
435 440 445 

Asp Trp Trp Arg Asn Glu Gin Phe Trp Val He Gly Gly Thr Ser Ala 
450 455 460 

His Leu Phe Ala Val Phe Gin Gly Leu Leu Lys Val Leu Ala Glv T ie 
465 470 475 480 

Asp Thr Asn Phe Thr Val Thr Ser Lys Ala Ser Asp Glu Asp Gly Asp 
485 490 495 

Phe Ala Glu Leu Tyr Val Phe Lys Trp Thr Ser Leu Leu He Pro Pro 
500 505 5io 

Thr Thr Val Leu He Val Asn Leu Val Gly He Val Ala Gly Val Ser 
515 520 525 

Tyr Ala lie Asn Ser Gly Tyr Gin Ser Trp Gly Pro Leu Phe Gly Lys 
530 535 540 

Leu Phe Phe Ala lie Trp Val He Ala His Leu Tyr Pro Phe Leu Lys 
545 550 555 5 60 

Gly Leu Leu Gly Arg Gin Asn Arg Thr Pro Thr He Val He Val Trp 
565 570 575 

Ser Val Leu Leu Ala Ser lie Phe Ser Leu Leu Trp Val Arg lie Asd 
580 585 * 590 

Pro Phe Thr Ser Asp Ser Asn Lys Leu Thr Asn Gly Gin Cys Gly lie 
595 600 6 05 

Asn Cys 
610 

<210> 17 

<211> 2890 

<212> DNA 

<213> Glycine max 

<400> 17 

gcacgagctc cacccagtta atgatgcata tggcttgtgg ttgacatcag tcatctgtga 60 

aatatggttt gctgtatcat ggataatgga tcagtttcca aaatggtacc caatacagcg 120 

agaaacatac cttgatcgtc tgtcactcag gtatgaaaaa gaagggaagc catctgagtt 180 
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gtccagtgta gacgtctttg tcagtactgt tgatcccatg aaggaacctc cactgattac 240 
agcaaacact gttctatcta tccttgctgt tgattatcca gttgataaag ttgcatgcta 300 
tgtctcagat gatggtgctg ctatgcttac ttttgaagca ctgtctgaga catctgaatt 360 
tgctaggaga tgggttccat tttgtaagaa atacaatatt gagccccggg caccagaatg 420 
gtactttggt cagaagatgg actatctgaa aaataaagta cacccagcat ttgtcaggga 480 
aaggagagca atgaagaggg attatgaaga atttaaggtg aggattaaca gtttggtggc 540 
aacagcacaa aaggttcctg aggatggatg gaccatgcaa gatgggactc cttggcctgg 600 
aaataatgtg agggatcatc ctggcatgat tcaggtcttc cttgggcagg atggtgttcg 660 
tgatgttgaa ggaaatgagc taccccgctt ggtctacgtt tctagagaaa agaggccagg 720 
gtttgatcac cacaaaaagg ctggtgcaat gaatgctctg gtacgggctt cagcaattat 780 
cactaatgca ccctatcttc tgaatgttga ttgtgatcac tacattaaca atagcaaggc 840 
acttagagaa gctatgtgtt ttatgatgga tcctcaacta gggaaaaagg tttgctatgt 900 
gcaatttcct cagcgatttg atggaattga tagacatgat agatattcaa acagaaatgt 960 
tgtatttttc gatattaaca tgaaaggatt ggatgggata caaggtccaa tatatgtcgg 1020 
aactggatgt gttttcagaa ggtacgcact ttatggatat gatgcacctg ccaagaagaa 1080 
accaccgagc aaaacttgta actgttggcc aaagtggtgc tgcctatgtt gtggctctag 1140 
aaagaaaaag aatgccaata gtaagaagga gaaaaagagg aaggtgaagc acagtgaagc 1200 
atcaaagcag atacatgcac ttgaaaatat tgaggcgggg aatgaaggaa ccaacaatga 1260 
gaagacatcc aatctgactc aaacaaagtt ggagaagagg tttgaacagt ctccagtatt 1320 
tgtagcctcc acacttttgg atgatggtgg agttccacat ggcgtgagtc ctgcatcact 1380 
tttaaaagaa gccatccagg tcatcagttg tggttatgaa gacaaaacag aatggggaaa 1440 
agaagttggg tggatatatg gttctgtgac agaggatatc ttgactggat ttaaaatgca 1500 
ttgccatggt tggcggtctg tgtattgcat tcctaagcgg cctgcattta aggggtctgc 1560 
gcctatcaac ctttcagatc gtctgcacca agttcttcgg tgggctcttg ggtctgttga 1620 
gatttttttc agcagacatt gtccaatctg gtatggctat ggtggtggat tgaaattgtt 1680 
ggaacgattt tcctacatta actcggtcgt atatccctgg acttccctcc cattgcttgt 1740 
ctactgtact ctaccagcca tatgccttct gactggaaaa tttatcgtac ccgagactag 1800 
caactatgcc agtcttgtgt tcatggccct cttcatatcc attgcagcaa ctggcatcct 1860 
tgagatgcaa tggggcggtg ttagcataga cgactggtgg aggaacgaac agttttgggt 1920 
gatcggaggt gtttcttccc atctatttgc cctatttcag ggtttactga aggtcttggc 1980 
tggtgtgaac acaaacttca ctgtgacctc aaaagcagca gatgatggag aattctcaga 2040 
actctacata ttcaagtgga catcactctt gatccctcca atgactttac ttatcatgaa 2100 
tattgtcggg gtggttgtcg ggatctcaga tgccatcaac aatggttatg actcatgggg 2160 
acctctcttt ggtagattgt tctttgcatt gtgggtgatc ctccatctct accccttctt 2220 
gaaggggttg cttggaaaac aagatagaat gccaaccatt atattggttt ggtcaatcct 2280 
tctggcctcc atcttgactc tcatgtgggt cagaattaac ccgtttgtgt caagagacgg 2340 
ccccgtgtta gaaatttgtg gattgaattg cgacgagtcg tgaataaaga aaaactgaag 2400 
aagaagggtt agttattttt cagctacact gcagtcatgt tgaagaatgc agccaacaca 24 60 
tgcttcacaa agttgcacga attttcggat ggaagtttta tttttcgggt atgzagagac 2520 
taaagagagg aaggggaggg ggctgacaca ttgttacctt gtaatagggt tttttcattt 2580 
attctttgat tatattttct gtgggtttta gtgttattct ctttccagtt tcatgtatta 2640 
taagaaagag gcattgaatg ataaattatt ccctcttcaa aatgggggat cctcagtctc 2700 
aagaaattac ttggtcatat ttttaggtat gggtcttgtt ctgtttaaaa ccatttgtaa 2760 
taccgtcaaa actatggata ttcttgttcc tcagatgtgt ttttgtgttt tattatttaa 2820 
cactcaggaa ccttttggtt tgattcaatt attcaatgtt tggatggcac taaaaaaaaa 2880 
aaaaaaaaaa 289Q 

<210> 18 

<211> 793 

<212> PRT 

<213> Glycine max 

<400> 18 

His Glu Leu His Pro Val Asn Asp Ala Tyr Gly Leu Trp Leu Thr Ser 
1 5 10 1 15 

Val He Cys Glu lie Trp Phe Ala Val Ser Trp lie Met Asp Gin Phe 
20 25 * 30 
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Pro Lys Trp Tyr Pro He Gin Arg Giu Thr Tvr Leu Asp Arg Leu Ser 
35 40 45 

Leu Arg Tyr Glu Lys Glu Gly Lys Pro Ser Glu Leu Ser Ser Val Asp 
50 55 60 

Val Phe Val Ser Thr Val Asp Pro Met Lys Glu Pro Pro Leu He Thr 
65 70 75 80 

Ala Asn Thr Val Leu Ser He Leu Ala Val Asp Tyr Pro Val Asp Lys 
85 90 95 

Val Ala Cys Tyr Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe Glu 
100 105 no 

Ala Leu Ser Glu Thr Ser Glu Phe Ala Arg Arg Trp Val Pro Phe Cys 
115 120 125 

Lys Lys Tyr Asn He Glu Pro Arg Ala Pro Glu Trp Tyr Phe Gly Gin 
130 135 i4o 

Lys Met Asp Tyr Leu Lys Asn Lys Val His Pro Ala Phe Val Arg Glu 
I 45 150 155 - 16Q 

Arg Arg Ala Met Lys Arg Asp Tyr Glu Glu Phe Lys Val Arg He Asn 
165 170 175 

Ser Leu Val Ala Thr Ala Gin Lys Val Pro Glu Asp Gly Trp Thr Met 
180 185 190 

Gin Asp Gly Thr Pro Trp Pro Gly Asn Asn Val Arg Asp His Pro Gly 
195 200 205 

Met He Gin Val Phe Leu Gly Gin Asp Gly Val Arg Aso Val Glu Gly 
210 215 220 

Asn Giu Leu Pro Arg Leu Val Tyr Val Ser Ara Glu Lvs Arg Pro Gly 
225 230 235 " 240 

Phe Asp His His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Ara Ala 
245 250 255 

Ser Ala He He Thr Asn Ala Pro Tyr Leu Leu Asn Val Asp Cys Asp 
260 265 270 

His Tyr He Asn Asn Ser Lys Ala Leu Arg Glu Ala Met Cys Phe Met 
275 280 285 

Met Asp Pro Gin Leu Gly Lys Lys Val Cys Tyr Val Gin Phe Pro Gin 
290 295 300 

Arg Phe Asp Gly lie Asp Arg His Asp Arg Tyr Ser Asn Arg Asn Val 
305 310 315 ^ 320 

Val Phe Phe Asp He Asn Met Lys Gly Leu Asp Gly lie Gin Gly Pro 
325 330 335 

He Tyr Val Gly Thr Gly Cys Val Phe Arg Arg Tyr Ala Leu Tyr Gly 
340 345 350 
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Tyr Asp Ala Pro Ala Lys Lys Lys Pro Pro Ser Lys Thr Cys Asn Cys 
355 360 365 

Trp Pro Lys Trp Cys Cys Leu Cys Cys Gly Ser Arg Lys Lys Lys Asn 
370 375 380 

Ala Asn Ser Lys Lys Glu Lys Lys Arg Lys Val Lys His Ser Glu Ala 
385 390 395 400 

Ser Lys Gin He His Ala Leu Glu Asn He Glu Ala Gly Asn Glu Gly 
405 410 415 

Thr Asn Asn Glu Lys Thr Ser Asn Leu Thr Gin Thr Lys Leu Glu Lys 
420 425 430 

Arg Phe Gly Gin Ser Pro Val Phe Val Ala Ser Thr Leu Leu Asp Asp 
435 440 445 

Gly Gly Val Pro His Gly Val Ser Pro Ala Ser Leu Leu Lys Glu Ala 
450 455 460 

He Gin Val He Ser Cys Gly Tyr Glu Asp Lys Thr Glu TrD Gly Lys 
465 470 475 " 480 

Glu Val Gly Trp He Tyr Gly Ser Val Thr Glu Asp He Leu Thr Gly 
485 490 495 

Phe Lys Met His Cys His Gly Trp Arg Ser Val Tyr Cys He Pro Lys 
500 505 510 

Arg Pro Ala Phe Lys Gly Ser Ala Pro He Asn Leu Ser Asp Arg Leu 
515 520 525 

His Gin Val Leu Arg Trp Ala Leu Gly Ser Val Glu He Phe Phe Ser 
530 535 540 

Arg His Cys Pro lie Trp Tyr Gly Tyr Gly Gly Gly Leu Lys Leu Leu 
545 550 555 560 

Glu Arg Phe Ser Tyr He Asn Ser Val Val Tyr Pro Trp Thr Ser Leu 
565 570 ~ 575 

Pro Leu Leu Val Tyr Cys Thr Leu Pro Ala He Cys Leu Leu Thr Gly 
580 585 1 590 

Lys Phe He Val Pro Glu He Ser Asn Tyr Ala Ser Leu Val Phe Met 
595 600 605 

Ala Leu Phe He Ser He Ala Ala Thr Gly He Leu Glu Met Gin Trp 
610 615 620 

Gly Gly Val Ser He Asp Asp Trp Trp Arg Asn Glu Gin Phe Trp Val 
625 630 635 640 

He Gly Gly Val Ser Ser His Leu Phe Ala Leu Phe Gin Gly Leu Leu 
645 650 655 

Lys Val Leu Ala Gly Val Asn Thr Asn Phe Thr Val Thr Ser Lys Ala 
660 665 670 
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Ala Asp Asp Gly Glu Phe Ser Glu Leu Tyr lie Phe Lys Trp Thr Ser 
675 680 685 

Leu Leu He Pro Pro Met Thr Leu Leu He Met Asn He Val Gly Val 
690 695 700 

Val Val Gly He Ser Asp Ala He Asn Asn Gly Tyr Asp Ser Trp Glv 
705 710 715 720 

Pro Leu Phe Gly Arg Leu Phe Phe Ala Leu Trp Val He Leu His Leu 
725 730 735 

Tyr Pro Phe Leu Lys Gly Leu Leu Gly Lys Gin Asp Arg Met Pro Thr 
? 40 745 750 

He He Leu Val Trp Ser He Leu Leu Ala Ser He Leu Thr Leu Met 
755 760 765 

Trp Val Arg He Asn Pro Phe Val Ser Arg Asp Gly Pro Val Leu Glu 
770 775 780 

He Cys Gly Leu Asn Cys Asp Glu Ser 
785 790 

<210> 19 

<211> 1742 

<212> DNA 

<213> Triticum aestivum 
<220> 

<221> unsure 

<222> (9) 



<220> 

<221> unsure 
<222> (271) 



<400> 19 

>wheatcong aagctatgtg cttcctaatg 
gtgcagttcc cacaaaggtt tgatggtatt 
actgtctttt ttgatattaa crtgaggggc 
ggaactggtt gtgttttcaa cagaacggct 
aagaagccag gtttcttggc atcattatgt 
aaaaggagct cagataagaa aaagtcgaac 
aatctcgaag acatagagga gggtgttgaa 
ctcatgtctc aaatgagctt agagaagaga 
actctgatgg aatatggtgg tgttcctcag 
gctatccatg tcataagttg tggctatgag 
tggatctatg gatctgtcac agaagatatt 
tggcgttcaa tctattgcat gcccaagcgc 
ctttcagatc gtctgaatca agtgctgcgg 
agccggcatt gccccttatg gtatggctac 
gcttacatca acaccaccat ttacccacta 
ttgcctgcta tctgtctgct cactggaaag 
agtatctggt tcattgcgct cttcctttca 
tggagtggtg ttggcattga cgagtggtgg 
atctctgccc atctgtttgc cgtctttcag 
accaacttca ctgtcacctc aaaggctaat 
atgttcaagt ggacgacgct tcttatccct 
ggtgtcgttq ctggtacctc ctacgccatc 



gatccaaacc taggtccgca agtctgttat 60 
gataggaatg atcgatatgc aaacaggaac 120 
cttgacggca ttcaaggacc agtttatgtg 180 
atctatggtt atgagccccc aattaaggcg 240 
nggggcaaga agaaggcaag caagtcaaag 300 
aagcatgtgg acagttctgt tccagtattc 360 
ggtgctgggt ttgatgatga gaaatcagtt 420 
tttggccagt cagcagcatt tgttgcctcc 480 
tcgtccactc cagaatctct tttgaaagaa 540 
gacaagtctg aatggggaac tgagattggt 600 
ctaactggat tcaagatgca cgcaagaggc 660 
ccagctttca agggatctgc ccccatcaat 720 
tgggctcttg gttctgttga aattcttttc 780 
ggagggcgcc tcaagttcct ggagagattc 840 
acctctctcc cgcttctagt ctattgtata 900 
ttcatcatgc cagagattag caacttggcc 960 
attttcgcca ctggtatcct tgagatgagg 1020 
aggaatgaac agttctgggt cattggaggt 1080 
ggtcttctga aggtgcttgc aggtatcgac 1140 
gatgaagaag gcgactttgc tgagctctac 1200 
ccgacgacca ttttgatcat taacatggtc 1260 
aacagtggtt accaatcatg ggggccgctc 1320 



31 



WO 00/04166 



PCT/US99/15871 



tttgggaagc tcttctttgc cttctgggtg 
cttatgggca ggcaaaaccg cacaccgacg 
tctatcttct ccttgctgtg ggttcgtgtt 
aatatccaaa cctgtggcat caactgctag 
taacagtgat cgagcgacca cctgtggagc 
actacgtttg agaaagttgt caaaattgag 
tatctaccgt tttcatgagg ttaagctctt 
aa 

<210> 20 
<211> 506 
<212> PRT 

<213> Triticum aestivum 



attgttcact tatacccatt cctcaagggt 1380 
attgtcatcg tctgggctgt cctcctcgct 1440 
gatccattca ctacccgtct cgctggccca 1500 
gaaagtggga gtttgtagag acagaaaata 1560 
cagagaatat ttatgttggg gttgtgaatt 1620 
aaaacacatt tgtaaataga tgtaatagac 1680 
cttttttgga aaaaaaaaaa aaaaaaaaaa 1740 

1742 



<220> 

<221> UNSURE 
<222> (88) 



<400> 20 

Glu Ala Met Cys Phe Leu Met Asp Pro Asn Leu Gly Pro Gin Val Cys 
1 5 10 15 

Tyr Val Gin Phe Pro Gin Arg Phe Asp Gly lie Asd Arg Asn Asp Arq 
20 25 "30 

Tyr Ala Asn Arg Asn Thr Val Phe Phe Asp lie Asn Leu Arg Gly Leu 
35 40 45 

Asp Gly He Gin Gly Pro Val Tyr Val Gly Thr Gly Cys Val Phe Asn 
50 55 so 

Arg Thr Ala He Tyr Gly Tyr Glu Pro Pro He Lys Ala Lys Lys Pro 
65 70 75 80 

Gly Phe Leu Ala Ser Leu Cys Xaa Gly Lys Lys Lys Ala Ser Lys Ser 
85 90 95 

Lys Lys Arg Ser Ser Asp Lys Lys Lys Ser Asn Lys His Val Asp Ser 
100 105 110 

Ser Val Pro Val Phe Asn Leu Glu Asp He Glu Glu Gly Val Glu Glv 
115 120 125 

Ala Gly Phe Asp Asp Glu Lys Ser Val Leu Met Ser Gin Met Ser Leu 
130 135 140 

Glu Lys Arg Phe Gly Gin Ser Ala Ala Phe Val Ala Ser Thr Leu Met 
145 150 155 160 

Glu Tyr Gly Gly Val Pro Gin Ser Ser Thr Pro Glu Ser Leu Leu Lys 
165 170 175 

Glu Ala He His Val He Ser Cys Gly Tyr Glu Asp Lys Ser Glu Trp 
180 185 190 

Gly Thr Glu He Gly Trp He Tyr Gly Ser Val Thr Glu Asp He Leu 
195 200 205 

Thr Gly Phe Lys Met His Ala Arg Gly Trp Arg Ser He Tyr Cys Met 
Z1U 2 15 220 
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Pro Lys Arg Pro Ala Phe Lys Gly Ser Ala Pro He Asn Leu Ser Asp 
225 230 235 2 40 

Arg Leu Asn Gin Val Leu Arg Trp Ala Leu Gly Ser Val Glu He Leu 
245 250 255 

Phe Ser Arg His Cys Pro Leu Trp Tyr Gly Tyr Gly Gly Arg Leu Lys 
260 265 "270 

Phe Leu Glu Arg Phe Ala Tyr He Asn Thr Thr He Tyr Pro Leu Thr 
275 280 285 

Ser Leu Pro Leu Leu Val Tyr Cys He Leu Pro Ala He Cys Leu Leu 
290 295 300 

Thr Gly Lys Phe He Met Pro Glu He Ser Asn Leu Ala Ser lie Trp 
305 310 315 320 

Phe He Ala Leu Phe Leu Ser He Phe Ala Thr Gly He Leu Glu Met 
325 330 335 

Arg Trp Ser Gly Val Gly He Asp Glu Trp Trp Arg Asn Glu Gin Phe 
340 345 350 

Trp Val He Gly Gly He Ser Ala His Leu Phe Ala Val Phe Gin Gly 
355 360 365 

Leu Leu Lys Val Leu Ala Gly He Asp Thr Asn Phe Thr Val Thr Ser 
3"?0 375 380 

Lys Ala Asn Asp Glu Glu Gly Asp Phe Ala Glu Leu Tyr Met Phe Lvs 
385 390 395 4 00 

Trp Thr Thr Leu Leu He Pro Pro Thr Thr He Leu lie lie Asn Met 
405 410 415 

Val Gly Val Val Ala Gly Thr Ser Tyr Ala lie Asn Ser Gly Ty- Gin 
420 425 436 

Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe Trp Val lie 
435 440 445 

Val His Leu Tyr Pro Phe Leu Lys Gly Leu Met Gly Arg Gin Asn Ara 
450 455 46O 

Thr Pro Thr lie Val lie Val Trp Ala Val Leu Leu Ala Ser He Phe 
465 470 475 480 

Ser Leu Leu Trp Val Arg Val Asp Pro Phe Thr Thr Arg Leu Ala Gly 
485 490 495 

Pro Asn lie Gin Thr Cys Gly lie Asn Cys 
500 505 

<210> 21 
<211> 1029 
<212> DNA 

<213> Triticum aestivum 
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<400> 21 

gcacgagccg ctcctcacca acggccagat ggttgatgac atcccgccgg agcagcacgc 60 

gctcgtgccg tcctacatga gcggcggcgg cggcgggggc aagaggatcc acccgctccc 120 

tttcgcagat cccaaccttc cagtgcaacc gagatccatg gacccgtcca aggatctggc 180 

cgcctacgga tatggcagcg tggcctggaa ggagagaatg gagggctgga agcagaagca 24 0 

ggagcgcctg cagcatgtca ggagcgaggg tggcggtgat tgggatggcg acgatgcaga 300 

tctgccacta atggatgaag ctaggcagcc attgtccaga aaagtcccta tatcatcaag 360 

ccgaattaat ccctacagga tgattatcgt tatccggttg gtggttttgg gtttcttctt 420 

ccactaccga gtgatgcatc cggcgaaaga tgcatttgca ttgtggctca tatctgtaat 480 

ctgtgaaatc tggtttgcga tgtcctgtat tcttgatcag ttcccaaagt ggtttccaat 540 

cgagagagag acttacctgg accgtttgtc actaaggttt gacaaggaag gtcaaccctc 600 

tcagcttgct ccaatcgact tctttgtcag tacggttgat cccacaaagg aacctccctt 660 

ggtcacagcg aacactgtcc tttccatcct ttctgtggat tatccggttg agaaggtctc 720 

ctgctatgtt tctgatgatg gtgctgcaat gcttacgttt gaagcattgt ctgaaacatc 780 

tgaatttgca aagaaatggg ttcctttcag caaaaagttt aatatcgagc ctcgtgctcc 840 

tgagtggtac ttccaacaga agatagacta cctgaaagac aaggttgctg cttcatttgt 900 

tagggagagg agggcgatga agagagaata cgaggaattc aaggtaagga tcaatgcctt 960 

ggttgcaaaa gcccaaaagg ttcctgagga aggatggaca atgcaagatg gaagcccctg 1020 
gcctggaaa * 



1029 



<210> 22 
<211> 340 
<212> PRT 

<213> Triticum aestivum 
<400> 22 

Pro Leu Leu Thr Asn Gly Gin Met Val Asp Asp He Pro Pro Glu Gin 
15 10 is 

His Ala Leu Val Pro Ser Tyr Met Ser Gly Gly Gly Gly Gly Gly Lys 
20 25 30 

Arg He His Pro Leu Pro Phe Ala Asp Pro Asn Leu Pro Val Gin Pro 



35 40 



45 



Arg Ser Met Asp Pro Ser Lys Asp Leu Ala Ala Tyr Gly Tyr Gly Ser 



50 55 



60 



Val Ala Trp Lys Glu Arg Met Glu Gly Tro Lys Gin Lys Gin Glu Aro 
65 70 75 80 

Leu Gin His Val Arg Ser Glu Gly Gly Gly Asp Trp Asp Gly Asp Asp 
85 90 95 

Ala Asp Leu Pro Leu Met Asp Glu Ala Arg Gin Pro Leu Ser Arg Lys 



100 105 



110 



Val Pro He Ser Ser Ser Arg He Asn Pro Tyr Arg Met He He Val 
115 120 125 

He Arg Leu Val Val Leu Gly Phe Phe Phe His Tyr Arg Val Met His 
130 135 uo 

Pro Ala Lys Asp Ala Phe Ala Leu Trp Leu He Ser Val He Cys Glu 
145 150 155 i 6 o 

He Trp Phe Ala Met Ser Cys He Leu Asp Gin Phe Pro Lys Trp Phe 
165 170 175 
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Pro He Glu Arg Glu Thr Tyr Leu Asp Arg Leu Ser Leu Arg Phe Asd 
180 18 5 190 

Lys Glu Gly Gin Pro Ser Gin Leu Ala Pro He Asd Phe Phe Val Ser 
195 200 * 205 

Thr Val Asp Pro Thr Lys Glu Pro Pro Leu Val Thr Ala Asn Thr Val 
210 215 2 20 

Leu Ser He Leu Ser Val Asp Tyr Pro Val Glu Lys Val Ser Cys Tvr 
225 230 235 2 40 

Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe Glu Ala Leu Ser Glu 
245 250 255 

Thr Ser Glu Phe Ala Lys Lys Trp Val- Pro Phe Ser Lys Lys Phe Asn 
260 265 270 

He Glu Pro Arg Ala Pro Glu Trp Tyr Phe Gin Gin Lys He Asp Tvr 
275 280 285 

Leu Lys Asp Lys Val Ala Ala Ser Phe Val Arg Glu Arg Arg Ala Me* 
290 295 300 

Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg He Asn Ala Leu Val Ala 
305 310 315 320 

Lys Ala Gin Lys Val Pro Glu Glu Gly Trp Thr Met Gin Asp Gly Ser 
325 330 335 

Pro Trp Pro Gly 
340 

<210> 23 
<211> 1081 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 23 

Met Glu Ala Ser Ala Gly Leu Val Ala Gly Ser Tyr Arg Arg Asn Glu 
15 10 is 

Leu Val Arg He Arg His Glu Ser Asp Glv Gly Thr Lys Pro Leu Lys 
20 25 ' 30 

Asn Met Asn Gly Gin He Cys Gin He Cys Gly Asp Asp Val Gly Leu 
35 40 45 

Ala Glu Thr Gly Asp Val Phe Val Ala Cys Asn Glu Cys Ala Phe Pro 
50 55 60 

Val Cys Arg Pro Cys Tyr Glu Tyr Glu Arg Lys Asp Gly Thr Gin Cys 
65 7 0 75 80 

Cys Pro Gin Cys Lys Thr Arg Phe Arg Arg His Arg Gly Ser Pro Arg 
85 90 " 95 

Val Glu Gly Asp Glu Asp Glu Asp Asp Val Asp Asp He Glu Asn Glu 
100 105 no 
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Phe Asn Tyr Ala Gin Gly Ala Asn Lys Ala Arg His Gin Arg His Gly 
115 120 125 

Glu Glu Phe Ser Ser Ser Ser Arg His Glu Ser Gin Pro lie Pro Leu 
130 135 140 

Leu Thr His Gly His Thr Val Ser Gly Glu He Arg Thr Pro Asp Thr 
145 150 155 160 

Gin Ser Val Arg Thr Thr Ser Gly Pro Leu Gly Pro Ser Asp Arg Asn 
165 170 175. 

Ala lie Ser Ser Pro Tyr lie Asp Pro Arg Gin Pro Val Pro Val Arg 
180 185 190 

He Val Asp Pro Ser Lys Asp Leu Asn- Ser Tyr Gly Leu Gly Asn Val 
195 200 205 

Asp Trp Lys Glu Arg Val Glu Gly Trp Lys Leu Lys Gin Glu Lys Asn 
210 215 220 

Met Leu Gin Met Thr Gly Lys Tyr His Glu Gly Lvs Gly Gly Glu He 
225 230 235 " 240 

Glu Gly Thr Gly Ser Asn Gly Glu Glu Leu Gin Met Ala Asp Asp Thr 
245 250 255 

Arg Leu Pro Met Ser Arg Val Val Pro He Pro Ser Ser Arg Leu Thr 
260 265 270 

Pro Tyr Arg Val Val lie lie Leu Arg Leu He He Leu Cys Phe Phe 
2*75 280 285 

Leu Gin Tyr Arg Thr Thr His Pro Val Lys Asn Ala Tyr Pro Leu Trp 
290 295 300 

Leu Thr Ser Val He Cys Glu He Trp Phe Ala Phe Ser Trp Leu Leu 
305 310 315 320 

Asp Gin Phe Pro Lys Trp Tyr Pro He Asn Arg Glu Thr Tyr Leu Asp 
325 330 335 

Arg Leu Ala He Arg Tyr Asp Arg Asp Gly Glu Pro Ser Gin Leu Val 
340 345 350 

Pro Val Asp Val Phe Val Ser Thr Val Asp Pro Leu Lys Glu Pro Pro 
355 360 365 

Leu Val Thr Ala Asn Thr Val Leu Ser He Leu Ser Val Asp Tyr Pro 
370 375 380 



Val Asp Lys Val Ala Cys Tyr Val Ser Asp Asp Gly Ser Ala Met Leu 
385 390 395 400 

Thr Phe Glu Ser Leu Ser Glu Thr Ala Glu Phe Ala Lys Lys Trp Val 
405 410 415 

Pro Phe Cys Lys Lys Phe Asn He Glu Pro Arg Ala Pro Glu Phe Tyr 
420 425 430 
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Phe Ala Gin Lys He Asp Tyr Leu Lys Asp Lys He Gin Pro Ser Phe 
435 440 " 445 

Val Lys Glu Arg Arg Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Val 
450 455 460 

Arg He Asn Ala Leu Val Ala Lys Ala Gin Lys He Pro Glu Glu Gly 
465 470 475 480 

Trp Thr Met Gin Asp Gly Thr Pro Trp Pro Gly Asn Asn Thr Arg Asp 
485 490 495 

His Pro Gly Met lie Gin Val Phe Leu Gly His Ser Gly Gly Leu Asp 
500 505 510 

Thr Asp Gly Asn Glu Leu Pro Arg Leu- He Tyr Val Ser Arg Glu Lys 
515 520 525 

Arg Pro Gly Phe Gin His His Lys Lys Ala Gly Ala Met Asn Ala Leu 
530 535 540 

He Arg Val Ser Ala Val Leu Thr Asn Gly Ala Tvr Leu Leu Asn Val 
545 550 555 ' 560 

Asp Cys Asp His Tyr Phe Asn Asn Ser Lys Ala lie Lys Glu Ala Met 
565 570 575 

Cys Phe Met Met Asp Pro Ala He Gly Lys Lys Cys Cys Tyr Val Gin 
580 585 ~ 590 

Phe Pro Gin Arg Phe Asp Gly He Asp Leu His Asp Arg Tyr Ala Asn 
595 600 605 

Arg Asn He Val Phe Phe Asp He Asn Met Lys Gly Leu Asp Gly He 
610 615 620 

Gin Gly Pro Val Tyr Val Gly Thr Gly Cys Cys Phe Asn Arg Gin Ala 
625 630 635 640 

Leu Tyr Gly Tyr Asp Pro Val Leu Thr Glu Glu Asp Leu Glu Pro Asn 
645 650 655 

He He Val Lys Ser Cys Cys Gly Ser Arg Lys Lys Giy Lys Ser Ser 
660 665 670 

Lys Lys Tyr Asn Tyr Glu Lys Arg Arg Gly lie Asn Arg Ser Asp Ser 
675 680 685 

Asn Ala Pro Leu Phe Asn Met Glu Asp lie Asp Glu Gly Phe Glu Gly 
690 695 700 

Tyr Asp Asp Glu Arg Ser He Leu Met Ser Gin Arg Ser Val Glu Lys 
705 710 715 ' 720 

Arg Phe Giy Gin Ser Pro Val Phe lie Ala Ala Thr Phe Met Glu Gin 
725 730 735 

Gly Gly lie Pro Pro Thr Thr Asn Pro Ala Thr Leu Leu Lys Glu Ala 
^40 745 750 
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lie His Val He Ser Cys Gly Tyr Glu Asp Lys Thr Glu Trp Gly Lys 
755 760 765 

Glu He Gly Trp He Tyr Gly Ser Val Thr Glu Asp He Leu Thr Glv 
770 775 780 

Phe Lys Met His Ala Arg Gly Trp He Ser He Tyr Cys Asn Pro Pro 
785 7 90 795 800 

Arg Pro Ala Phe Lys Gly Ser Ala Pro He Asn Leu Ser Asp Arg Leu 
805 810 815 

Asn Gin Val Leu Arg Trp Ala Leu Gly Ser He Glu He Leu Leu Ser 
820 825 830 

Arg His Cys Pro He Trp Tyr Gly Tyr His Gly Arg Leu Arg Leu Leu 
835 840 " 845 

Glu Arg He Ala Tyr He Asn Thr He Val Tyr Pro He Thr Ser He 
850 855 860 

Pro Leu He Ala Tyr Cys He Leu Pro Ala Phe Cys Leu He Thr Asd 
865 87 ° 875 880 

Arg Phe He He Pro Glu He Ser Asn Tyr Ala Ser He Trp Phe He 
885 890 895 

Leu Leu Phe He Ser He Ala Val Thr Gly He Leu Glu Leu Arq Trp 
900 905 910 

Ser Gly Val Ser lie Glu Asp Trp Trp Arg Asn Glu Gin Phe Trp Val 
915 920 925 

He Gly Gly Thr Ser Ala His Leu Phe Ala Val Phe Gin Gly Leu Leu 
930 935 940 

Lys Val Leu Ala Gly He Asp Thr Asn Phe Thr Val Thr Ser Lys Ala 
945 950 955 960 

Thr Asp Glu Asp Gly Asp Fhe Ala Glu Leu Tvr He Phe Lys Trp Thr 
965 970 975 

Ala Leu Leu He Pro Pro Thr Thr Val Leu Leu Val Asn Leu He Glv 
980 985 990 

He Val Ala Gly Val Ser Tyr Ala Val Asn Ser Gly Tyr Gin Ser Trp 
995 1000 1005 

Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Leu Trp Val He Ala His 
1010 1015 1020 

Leu Tyr Pro Phe Leu Lys Gly Leu Leu Gly Arg Gin Asn Arg Thr Pro 
1025 1030 1035 1040 

Thr He Val He Val Trp Ser Val Leu Leu Ala Ser He Phe Ser Leu 
1045 1050 10 55 

Leu Trp Val Arg He Asn Pro Phe Val Asp Ala Asn Pro Asn Ala Asn 
1060 1065 1070 



38 



WO 00/04166 



PCT/US99/15871 



Asn Phe Asn Gly Lys Gly Gly Val Phe 

1080 





1075 


<210> 


24 


<211> 


1084 


<212> 


PRT 


<213> 


Arabidopsis thaliana 


<400> 


24 


Met Asn 


Thr Gly Gly Arg Leu 


1 


5 



10 " 15 

Phe Val Leu lie Asn Ala Asp Glu Ser Ala Arg He Arg Ser Val Gin 
20 25 30 

Glu Leu Ser Gly Gin Thr Cys Gin He Cys Gly Asp Glu He Glu Leu 
35 40 45 

Thr Val Ser Ser Glu Leu Phe Val Ala Cys Asn Glu Cys Ala Phe Pro 
50 55 60 

Val Cys Arg Pro Cys Tyr Glu Tyr Glu Arg Arg Glu Gly Asn Gin Ala 
65 70 75 " 80 

Cys Pro Gin Cys Lys Thr Arg Tyr Lys Arg He Lys Gly Ser Pro Arg 
85 90 95 

Val Asp Gly Asp Asp Glu Glu Glu Glu Asp He Asp Asp Leu Glu Tyr 
100 105 no 

Glu Phe Asp His Gly Met Asp Pro Glu His Ala Ala Glu Ala Ala Leu 
115 120 125 

Ser Ser Arg Leu Asn Thr Gly Arg Gly Gly Leu Asp Ser Ala Pro Pro 
130 135 140 

Gly Ser Gin He Pro Leu Leu Thr Tyr Cys Asp Glu Asp Ala Aso Met 
145 150 155 * 160 

Tyr Ser Asp Arg His Ala Leu He Val Pro Pro Ser Thr Gly Tyr Gly 
165 170 175 

Asn Arg Val Tyr Pro Ala Pro Phe Thr Asp Ser Ser Ala Pro Pro Gin 
180 185 190 

Ala Arg Ser Met Val Pro Gin Lys Asp lie Ala Glu Tyr Gly Tyr Gly 
195 200 205 

Ser Val Ala Trp Lys Asp Arg Met Glu Val Trp Lys Arg Arg Gin Gly 
210 215 220 

Glu Lys Leu Gin Val He Lys His Glu Gly Gly Asn Asn Gly Arg Gly 
225 230 235 240 

Ser Asn Asp Asp Asp Glu Leu Asp Asp Pro Asp Met Pro Met Met Asp 
24 5 250 255 

Glu Gly Arg Gin Pro Leu Ser Arg Lys Leu Pro lie Arg Ser Ser Arg 
260 265 270 
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He Asn Pro Tyr Arg Met Leu He Leu Cys Arg Leu Ala lie Leu 



275 



280 



285 



Gly 



Leu Phe Phe His Tyr Arg lie Leu His Pro Val Asn Asp Ala Tyr Glv 

290 one J 



295 



300 



Leu Trp Leu Thr Ser Val lie Cys Glu lie Trp Phe Ala Val Ser Trp 
305 310 315 320 

He Leu Asp Gin Phe Pro Lys Trp Tyr Pro lie Glu Arg Glu Thr Tyr 
325 330 " 335 

Leu Asp Arg Leu Ser Leu Arg Tyr Glu Lys Glu Gly Lys Pro Ser Glv 
340 345 350 

Leu Ala Pro Val Asp Val Phe Val Ser- Thr Val Asp Pro Leu Lys Glu 
3 *5 360 3 6 5 

Pro Pro Leu He Thr Ala Asn Thr Val Leu Ser He Leu Ala Val Asd 
370 375 3 8 o F 



Tyr Pro Val Asp Lys Val Ala Cys Tyr Val Ser Aso Asd Gly Ala Al 
385 390 395 



a 

400 



Met Leu Thr Phe Glu Ala Leu Ser Asp Thr Ala Glu Phe Ala Arq Lys 
405 4 io 415 

Trp Val Pro Phe Cys Lys Lys Phe Asn He Glu Pro Arg Ala Pro Glu 
420 425 430 

Trp Tyr Phe Ser Gin Lys Met Asp Tyr Leu Lys Asn Lys Val His Pro 
435 440 445 

Ala Phe Val Arg Glu Arg Arg Ala Met Lys Arg Asp Tyr Glu Glu Phe 
450 455 460 

Lys Val Lys He Asn Ala Leu Val Ala Thr Ala Gin Lvs Val Pro Glu 
465 47 0 475 ' 480 

Glu Gly Trp Thr Met Gin Asp Gly Thr Pro Tro Pro Gly Asn Asn Val 
485 490 " 495 

Arg Asp His Pro Gly Met He Gin Val Phe Leu Gly His Ser Gly Val 
500 505 5io 

Arg Asp Thr Asp Gly Asn Glu Leu Pro Arg Leu Val Tyr Val Ser Arq 
515 520 525 

Glu Lys Arg Pro Gly Phe Asp His His Lys Lys Ala Gly Ala Met Asn 
530 535 540 

Ser Leu lie Arg Val Ser Ala Val Leu Ser Asn Ala Pro Tyr Leu Leu 
545 550 555 1 5 6 o 

Asn Val Asp Cys Asp His Tyr He Asn Asn Ser Lys Ala He Arg Glu 
565 570 575 

Ser Met Cys Phe Met Met Asp Pro Gin Ser Gly Lys Lys Val Cys Tyr 
580 585 590 
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Val Gin Phe Pro Gin Arg Phe Asp Gly He Asp Arg His Asp Arg Tyr 
595 600 605 

Ser Asn Arg Asn Val Val Phe Phe Asp lie Asn Met Lys Gly Leu Asp 
610 615 620 

Gly He Gin Gly Pro He Tyr Val Gly Thr Gly Cys Val Phe Arg Arq 
625 630 635 640 

Gin Ala Leu Tyr Gly Phe Asp Ala Pro Lys Lys Lys Lys Pro Pro Gly 
645 650 655 

Lys Thr Cys Asn Cys Trp Pro Lys Trp Cys Cys Leu Cys Cys Gly Leu 
660 665 670 

Arg Lys Lys Ser Lys Thr Lys Ala Lys Asp Lys Lys Thr Asn Thr Lys 
675 680 685 

Glu Thr Ser Lys Gin He His Ala Leu Glu Asn Val Asp Glu Gly Val 
690 695 700 

He Val Pro Val Ser Asn Val Glu Lys Arg Ser Glu Ala Thr Gin Leu 
705 710 715 720 

Lys Leu Glu Lys Lys Phe Gly Gin Ser Pro Val Phe Val Ala Ser Ala 
725 730 735 

Val Leu Gin Asn Gly Gly Val Pro Arg Asn Ala Ser Pro Ala Cys Leu 
7 ^0 745 750 

Leu Arg Glu Ala He Gin Val He Ser Cys Gly Tyr Glu Asp Lys Thr 
755 760 765 

Glu Trp Gly Lys Glu He Gly Trp He Tyr Gly Ser Val Thr Glu Asp 
770 775 780 



He Leu Thr Gly Phe Lys Met His Cys His Gly Trp Ara Ser Val Tyr 
785 790 795 



00 



Cys Met Pro Lys Arg Ala Ala Phe Lys Gly Ser Ala Pro He Asn Leu 
805 810 815 

Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly Ser Val Glu 
820 825 830 

He Phe Leu Ser Arg His Cys Pro He Trp Tyr Gly Tyr Gly Glv Glv 
835 840 845 

Leu Lys Trp Leu Glu Arg Phe Ser Tyr He Asn Ser Val Val Tyr Pro 
850 855 860 

Trp Thr Ser Leu Pro Leu He Val Tyr Cys Ser Leu Pro Ala Val Cys 
865 870 875 880 

Leu Leu Thr Gly Lys Phe He Val Pro Glu He Ser Asn Tyr Ala Gly 
885 890 895 

He Leu Phe Met Leu Met Phe He Ser He Ala Val Thr Gly lie Leu 
900 905 910 
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Glu Met Gin Trp Gly Gly Val Gly He Asp Asp Trp Tro Arg Asn Glu 
915 920 925 

Gin Phe Trp Val He Gly Gly Ala Ser Ser His Leu Phe Ala Leu Phe 
930 935 940 

Gin Gly Leu Leu Lys Val Leu Ala Gly Val Asn Thr Asn Phe Thr Val 
945 950 955 960 

Thr Ser Lys Ala Ala Asp Asp Gly Ala Phe Ser Glu Leu Tyr He Phe 
965 970 975 

Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu Leu He He Asn 
980 985 990 

He He Gly Val He Val Gly Val Ser- Asp Ala He Ser Asn Gly Tyr 
995 1000 1005 

Asp Ser Trp Gly Pro Leu Phe Gly Arg Leu Phe Phe Ala Leu Trp Val 
1010 1015 1020 

He Val His Leu Tyr Pro Phe Leu Lys Gly Met Leu Gly Lys Gin Asp 
1025 1030 1035 ' 1040 

Lys Met Pro Thr He He Val Val Trp Ser He Leu Leu Ala Ser lie 
1045 1050 1055 

Leu Thr Leu Leu Trp Val Arg Val Asn Pro Phe Val Ala Lys Gly Gly 
1060 1065 1070 

Pro Val Leu Glu He Cys Gly Leu Asn Cys Gly Asn 
1075 1080 

<210> 25 
<211> 685 
<212> PRT 

<213> Gossypium hirsutum 
<400> 25 

Arg Arg Trp Val Pro Phe Cys Lys Lys His Asn Val Glu Pro Arq Ala 
15 10 15 

Pro Glu Phe Tyr Phe Asn Glu Lys He Asp Tyr Leu Lys Asp Lys Val 
20 25 * 30 

His Pro Ser Phe Val Lys Glu Arg Arg Ala Met Lys Arg Glu Tyr Glu 
35 40 45 

Glu Phe Lys Val Arg He Asn Ala Leu Val Ala Lys Ala Gin Lys Lvs 
50 55 60 

Pro Glu Glu Gly Trp Val Met Gin Asp Gly Thr Pro Trp Pro Glv Asn 
65 ™ 75 bo 

Asn Thr Arg Asp His Pro Gly Met He Gin Val Tyr Leu Gly Ser Ala 
85 90 95 

Gly Ala Leu Asp Val Asp Gly Lys Glu Leu Pro Arg Leu Val Tyr Val 
100 105 no 
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Ser Arg Glu Lys Arg Pro Gly Tyr Gin His His Lys Lys Ala Gly Ala 
115 120 

Glu Asn Ala Leu Val Arg Val Ser Ala Val Leu Thr Asn Ala Pro Phe 
130 135 140 

lie Leu Asn Leu Asp Cys Asp His Tyr lie Asn Asn Ser Lys Ala Met 
145 "0 155 leo 

Arg Glu Ala Met Cys Phe Leu Met Asp Pro Gin Phe Gly Lys Lys Leu 
165 170 175 

Cys Tyr Val Gin Phe Pro Gin Arg Phe Asp Gly lie Asp Arg His Asp 
180 185 190 

Arg Tyr Ala Asn Arg Asn Val Val Phe Phe Asp He Asn Met Leu Gly 
195 200 205 

Leu Asp Gly Leu Gin Gly Pro Val Tyr Val Gly Thr Gly Cys Val Phe 
210 215 220 

Asn Arg Gin Ala Leu Tyr Gly Tyr Asp Pro Pro Val Ser Glu Lys Arq 

225 o-an ^ 



230 



235 



240 



Pro Lys Met Thr Cys Asp Cys Trp Pro Ser Trp Cys Cys Cys Cys Cys 
245 250 ' 255 

Gly Gly Ser Arg Lys Lys Ser Lys Lys Lys Gly Glu Lys Lys Gly Leu 
260 265 270 

Leu Gly Gly Leu Leu Tyr Gly Lys Lys Lys Lys Met Met Gly Lys Asn 
275 280 285 

Tyr Val Lys Lys Gly Ser Ala Pro Val Phe Asp Leu Glu Glu He Glu 
290 295 300 

Glu Gly Leu Glu Gly Tyr Glu Glu Leu Glu Lys Ser Thr Leu Met Ser 
305 310 315 320 

Gin Lys Asn Phe Glu Lys Arg Phe Gly Gin Ser Pro Val Phe He Ala 
325 330 335 

Ser Thr Leu Met Glu Asn Gly Gly Leu Pro Glu Gly Thr Asn Ser Thr 
340 345 " 350 

Ser Leu lie Lys Glu Ala He His Val He Ser Cys Gly Tyr Glu Glu 
355 360 3 6 5 

Lys Thr Glu Trp Gly Lys Glu He Gly Trp He Tyr Gly Ser Val Thr 
370 375 3 8 o 

Glu Asp He Leu Thr Gly Phe Lys Met His Cys Arg Gly Trp Lys Ser 
385 390 395 40Q 

Val Tyr Cys Val Pro Lys Arg Pro Ala Phe Lys Gly Ser Ala Pro lie 
405 410 415 

Asn Leu Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly Ser 
4 20 425 1 430 
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Val Glu He Phe Leu Ser Arg His Cys Pro Leu Trp Tyr Gly Tyr Gly 
435 440 445 

Gly Lys Leu Lys Trp Leu Glu Arg Leu Ala Tyr He Asn Thr He Val 
450 455 460 

Tyr Pro Phe Thr Ser He Pro Leu Leu Ala Tyr Cys Thr He Pro Ala 
465 470 475 480 

Val Cys Leu Leu Thr Gly Lys Phe He He Pro Thr Leu Ser Asn Leu 
485 490 495. 

Thr Ser Val Trp Phe Leu Ala Leu Phe Leu Ser He He Ala Thr Gly 
500 505 510 

Val Leu Glu Leu Arg Trp Ser Gly Val Ser He Gin Asp Trp Trp Arq 
515 520 525 

Asn Glu Gin Phe Trp Val He Gly Gly Val Ser Ala His Leu Phe Ala 
530 535 540 

Val Phe Gin Gly Leu Leu Lys Val Leu Ala Gly Val Asp Thr Asn Phe 
545 550 555 560 

Thr Val Thr Ala Lys Ala Ala Asp Asp Thr Glu Phe Gly Glu Leu Tyr 
565 570 J 575 

Leu Phe Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu He He 
580 585 590 

Leu Asn Met Val Gly Val Val Ala Gly Val Ser Asp Ala He Asn Asn 
595 600 605 

Gly Tyr Gly Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe 
610 615 620 

Trp Val He Leu His Leu Tyr Pro Phe Leu Lys Gly Leu Met Gly Arq 
625 630 6 35 640 

Gin Asn Arg Thr Pro Thr He Val Val Leu Trp Ser He Leu Leu Ala 
645 650 655 

Ser He Phe Ser Leu Val Trp Val Arg He Asp Pro Phe Leu Pro Lys 
• 660 665 670 

Gin Thr Gly Pro Val Leu Lys Gin Cys Gly Val Glu Cys 
675 680 685 

<210> 26 
<211> 1111 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 26 

Met Ala Ser Thr Pro Pro Gin Thr Ser Lys Lys Val Arg Asn Asn Ser 
15 10 15 

Gly Ser Gly Gin Thr Val Lys Phe Ala Arg Arg Thr Ser Ser Gly Arq 
20 25 30 
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Tyr Val Ser Leu Ser Arg Asp Asn He Glu Leu Ser Glv Glu Leu Ser 
35 40 45 

Gly Asp Tyr Ser Asn Tyr Thr Val His He Pro Pro Thr Pro Asp Asn 
50 55 60 

Gin Pro Met Ala Thr Lys Ala Glu Glu Gin Tyr Val Ser Asn Ser Leu 
65 70 75 eo 

Phe Thr Gly Gly Phe Asn Ser Val Thr Arg Ala His Leu Met Asp Lys 
85 90 95 

Val He Asp Ser Asp Val Thr His Pro Gin Met Ala Gly Ala Lys Gly 
100 105 no 

Ser Ser Cys Ala Met Pro Ala Cys Asp Gly Asn Val Met Lys Asp Glu 
115 120 125 

Arg Gly Lys Asp Val Met Pro Cys Glu Cys Arg Phe Lys He Cys Arg 
130 135 140 

Asp Cys Phe Met Asp Ala Gin Lys Glu Thr Glv Leu Cvs Pro Giv Cvs 
145 150 155 ' ' 160 

Lys Glu Gin Tyr Lys He Gly Asp Leu Asp Asp Asp Thr Pro Asp Tyr 
165 170 * 175 

Ser Ser Gly Ala Leu Pro Leu Pro Ala Pro Gly Lys Asp Gin Arg Gly 
180 185 190 

Asn Asn Asn Asn Met Ser Met Met Lys Arg Asn Gin Asn Gly Glu Phe 
195 200 205 

Asp His Asn Arg Trp Leu Phe Glu Thr Gin Gly Thr Tyr Gly Tyr Glv 
210 215 220 

Asn Ala Tyr Trp Pro Gin Asp Glu Met Tyr Gly Asd Asd Met Aso Glu 
225 230 235 ' * * 240 

Gly Met Arg Gly Gly Met Val Glu Thr Ala Asp Lys Pro Trp Arg Pro 
245 250 255 

Leu Ser Arg Arg He Pro He Pro Ala Ala lie He Ser Pro Tyr Arg 
260 265 270 

Leu Leu lie Val He Arg Phe Val Val Leu Cys Phe Phe Leu Thr Trp 
275 280 285 

Arg lie Arg Asn Pro Asn Glu Asp Ala He Trp Leu Trp Leu Met Ser 
290 295 300 



lie lie Cys Glu Leu Trp Phe Gly Phe Ser Trp lie Leu Asp Gin lie 

305 310 315 320 

Pro Lys Leu Cys Pro lie Asn Arg Ser Thr Asp Leu Glu Val Leu Arg 
325 330 335 

Asp Lys Phe Asp Met Pro Ser Pro Ser Asn Pro Thr Gly Arg Ser Asp 
340 345 350 
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Leu Pro Gly He Asp Leu Phe Val Ser Thr Ala Asp Pro Glu Lvs Glu 
355 360 365 

Pro Pro Leu Val Thr Ala Asn Thr lie Leu Ser He Leu Ala Val Asp 
370 375 380 

Tyr Pro Val Glu Lys Val Ser Cys Tyr Leu Ser Asp Asp Gly Gly Ala 
385 390 395 400 

Leu Leu Ser Phe Glu Ala Met Ala Glu Ala Ala Ser Phe Ala Asp Leu 
405 410 415 

Trp Val Pro Phe Cys Arg Lys His Asn He Glu Pro Arg Asn Pro Asp 
420 425 430 

Ser Tyr Phe Ser Leu Lys He Asp Pro Thr Lys Asn Lys Ser Arg He 
435 440 445 

Asp Phe Val Lys Asp Arg Arg Lys He Lys Arg Glu Tyr Asp Glu Phe 
450 455 460 

Lys Val Arg He Asn Gly Leu Pro Asp Ser He Arg Arg Arg Ser Asd 
465 470 475 " 48 q 

Ala Phe Asn Ala Arg Glu Glu Met Lys Ala Leu Lys Gin Met Arg Glu 
485 490 495 

Ser Gly Gly Asp Pro Thr Glu Pro Val Lys Val Pro Lys Ala Thr Trp 
500 505 510 

Met Ala Asp Gly Thr His Trp Pro Gly Thr Trp Ala Ala Ser Thr Arg 
515 520 525 

Glu His Ser Lys Gly Asp His Ala Gly He Leu Gin Val Met Leu Lvs 
530 535 540 

Pro Pro Ser Ser Asp Pro Leu He Gly Asn Ser Asp Asp Lys Val He 
545 550 555 * J 560 

Asp Phe Ser Asp Thr Asp Thr Arg Leu Pro Met Phe Val Tyr Val Ser 
565 570 575 

Arg Glu Lys Arg Pro Gly Tyr Asp His Asn Lys Lys Ala Gly Ala Met 
580 585 590 

Asn Ala Leu Val Arg Ala Ser Ala lie Leu Ser Asn Gly Pro Phe He 
595 600 605 

Leu Asn Leu Asp Cys Asp His Tyr He Tyr Asn Cys Lys Ala Val Arq 
610 615 - 620 



Glu Gly Met Cys Phe Met Met Asp Arg Gly Gly Glu Asp He Cys Tyr 

625 630 635 640 

He Gin Phe Pro Gin Arg Phe Glu Gly He Asp Pro Ser Asp Arg Tyr 

645 650 655 

Ala Asn Asn Asn Thr Val Phe Phe Asp Gly Asn Met Arg Ala Leu Asp 

660 665 670 
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Gly Val Gin Gly Pro Val Tyr Val Gly Thr Gly Thr Met Phe Arq Ara 
675 680 685 

Phe Ala Leu Tyr Gly Phe Asp Pro Pro Asn Pro Asp Lys Leu Leu Glu 
690 695 700 

Lys Lys Glu Ser Glu Thr Glu Ala Leu Thr Thr Ser Asd Phe Asp Pro 
705 710 715 * 720 

Asp Leu Asp Val Thr Gin Leu Pro Lys Arg Phe Gly Asn Ser Thr Leu 
725 730 735 

Leu Ala Glu Ser He Pro He Ala Glu Phe Gin Gly Arg Pro Leu Ala 
7 40 745 750 

Asp His Pro Ala Val Lys Tyr Gly Arg Pro Pro Gly Ala Leu Arg Val 
755 760 765 

Pro Arg Asp Pro Leu Asp Ala Thr Thr Val Ala Glu Ser Val Ser Val 
770 775 780 

He Ser Cys Trp Tyr Glu Asp Lys Thr Glu Trp Glv Asp Arg Val Glv 
785 790 795 ' 800 

Trp He Tyr Gly Ser Val Thr Glu Asp Val Val Thr Gly Tyr Arg Met 
805 810 815 

His Asn Arg Gly Trp Arg Ser Val Tyr Cys lie Thr Lys Arg Asp Ser 
820 825 830 

Phe Arg Gly Ser Ala Pro He Asn Leu Thr Asp Arg Leu His Gin Val 
835 840 845 

Leu Arg Trp Ala Thr Gly Ser Val Glu He Phe Phe Ser Arg Asn Asn 
850 855 860 

Ala He Leu Ala Ser Lys Arg Leu Lys Phe Leu Gin Arg Leu Ala Tyr 
865 870 875 880 

Leu Asn Val Gly lie Tyr Pro Phe Thr Ser Leu Phe Leu He Leu Tyr 
885 890 895 

Cys Phe Leu Pro Ala Phe Ser Leu Phe Ser Gly Gin Phe He Val Arg 
900 905 910 

Thr Leu Ser He Ser Phe Leu Val Tyr Leu Leu Met lie Thr lie Cys 
915 920 925 

Leu lie Gly Leu Ala Val Leu Glu Val Lys Trp Ser Gly lie Gly Leu 
930 935 940 



Glu Glu Trp Trp Arg Asn Glu Gin Trp Trp Leu lie Ser Gly Thr Ser 
945 . 950 955 960 

Ser His Leu Tyr Ala Val Val Gin Gly Val Leu Lys Val lie Ala Gly 
965 970 975 

lie Glu lie Ser Phe Thr Leu Thr Thr Lys Ser Gly Gly Asp Asp Asn 
980 985 - 990 



47 



WO 00/04166 



PCT/US99/15871 



Glu Asp He Tyr Ala Asp Leu Tyr He Val Lys Trp Ser Ser Leu Met 
995 1000 1005 

He Pro Pro He Val He Ala Met Val Asn He He Ala He Val Val 
1010 1015 1020 

Ala Phe He Arg Thr He Tyr Gin Ala Val Pro Gin Trp Ser Lys Leu 
1025 1030 1035 1040 

He Gly Gly Ala Phe Phe Ser Phe Trp Val Leu Ala His Leu Tyr Pro 
1045 1050 1055 

Phe Ala Lys Gly Leu Met Gly Arg Arg Gly Lys Thr Pro Thr He Val 
1060 1065 1070 

Phe Val Trp Ala Gly Leu He Ala He Thr He Ser Leu Leu Trp Thr 
1075 1080 1085 

Ala He Asn Pro Asn Thr Gly Pro Ala Ala Ala Ala Glu Gly Val Gly 
1090 1095 iioo 

Gly Gly Gly Phe Gin Phe Pro 
1105 1110 

<210> 27 
<211> 1026 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 27 

Met Glu Ala Ser Ala Gly Leu Val Ala Gly Ser His Asn Arg Asn Glu 
15 10 is 

Leu Val Val He His Asn His Glu Glu Pro Lys Pro Leu Lys Asn Leu 
20 25 30 

Asp Gly Gin Phe Cys Glu He Cys Gly Asp Gin He Gly Leu Thr Val 
35 40 45 

Glu Gly Asp Leu Phe Val Ala Cys Asn Glu Cys Gly Phe Pro Ala Cvs 
50 55 6 o 

Arg Pro Cys Tyr Glu Tyr Glu Arg Arg Glu Gly Thr Gin Asn Cys Pro 
65 70 75 8Q 

Gin Cys Lys Thr Arg Tyr Lys Arg Leu Arg Gly Ser Pro Arg Val Glu 
85 90 95 

Gly Asp Glu Asp Glu Glu Asp He Asp Asp He Glu Tyr Glu Phe Asn 
100 105 no 

He Glu His Glu Gin Asp Lys His Lys His Ser Ala Glu Ala Met Leu 
115 120 125 

Tyr Gly Lys Met Ser Tyr Gly Arg Gly Pro Glu Asp Asp Glu Asn Gly 
130 135 140 

Arg Phe Pro Pro Val He Ala Gly Gly His Ser Gly Glu Phe Pro Val 
i45 150 155 160 
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Gly Gly Gly Tyr Gly Asn Gly Glu His Gly Leu His Lys Arg Val His 
165 170 175 

Pro Tyr Pro Ser Ser Glu Ala Gly Ser Glu Gly Gly Trp Arg Glu Arg 
180 185 190 

Met Asp Asp Trp Lys Leu Gin His Gly Asn Leu Gly Pro Glu Pro Asp 
195 200 205 

Asp Asp Pro Glu Met Gly Leu He Asp Glu Ala Arg Gin Pro Leu Ser 
210 215 220 

Arg Lys Val Pro He Ala Ser Ser Lys He Asn Pro Tyr Arg Met Val 
225 230 235 240 

He Val Ala Arg Leu Val He Leu Ala Val Phe Leu Arg Tyr Arg Leu 
245 250 255 

Leu Asn Pro Val His Asp Ala Leu Gly Leu Trp Leu Thr Ser Val He 
260 265 * 270 

Cys Glu He Trp Phe Ala Val Ser Trp He Leu Asp Gin Phe Pro Lys 
275 280 285 

Trp Phe Pro lie Glu Arg Glu Thr Tyr Leu Asp Arg Leu Ser Leu Arg 
290 295 300 

Tyr Glu Arg Glu Gly Glu Pro Asn Met Leu Ala Pro Val Asp Val Phe 
305 310 315 320 

Val Ser Thr Val Asp Pro Leu Lys Glu Pro Pro Leu Val Thr Ser Asn 
325 330 335 

Thr Val Leu Ser He Leu Ala Met Asp Tyr Pro Val Glu Lys He Ser 
340 345 350 

Cys Tyr Val Ser Asp Asp Gly Ala Ser Met Leu Thr Phe Glu Ser Leu 
355 360 365 

Ser Glu Thr Ala Glu Phe Ala Arg Lys Trp Val Pro Phe Cys Lys Lys 
37 0 375 380 

Phe Ser He Glu Pro Arg Ala Pro Glu Met Tyr Phe Thr Leu Lys Val 
385 390 395 400 

Asp Tyr Leu Gin Asp Lys Val His Pro Thr Phe Val Lys Glu Arg Arg 
405 410 415 

Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg lie Asn Ala Gin 
420 425 " 430 

Val Ala Lys Ala Ser Lys Val Pro Leu Glu Gly Trp He Met Gin Asp 
435 440 445 

Gly Thr Pro Trp Pro Gly Asn Asn Thr Lys Asp His Pro Gly Met He 
450 455 460 



Gin Val Phe Leu Gly His Ser Gly Gly Phe Asp Val Glu Gly His Glu 
465 470 475 ' 480 
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Leu Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro Gly Phe Gin 
485 490 495 

His His Lys Lys Ala Gly Ala Met Asn Ala Leu Val Arg Val Ala Gly 
500 505 510 

Val Leu Thr Asn Ala Pro Phe Met Leu Asn Leu Asp Cys Asp His Tyr 
515 520 525 

Val Asn Asn Ser Lys Ala Val Arg Glu Ala Met Cys Phe Leu Met Asp 
530 535 540 

Pro Gin He Gly Lys Lys Val Cys Tyr Val Gin Phe Pro Gin Arg Phe 
545 550 555 5 60 

Asp Gly He Asp Thr Asn Asp Arg Tyr Ala Asn Arg Asn Thr Val Phe 
565 570 575 

Phe Asp He Asn Met Lys Gly Leu Asp Gly He Gin Gly Pro Val Tyr 
580 585 590 

Val Gly Thr Gly Cys Val Phe Lys Arg Gin Ala Leu Tyr Gly Tyr Glu 
595 600 605 

Pro Pro Lys Gly Pro Lys Arg Pro Lys Met He Ser Cys Gly Cys Cys 
610 615 620 

Pro Cys Phe Gly Arg Arg Arg Lys Asn Lys Lys Phe Ser Lys Asn Asp 
625 630 635 640 

Met Asn Gly Asp Val Ala Ala Leu Gly Gly Ala Glu Gly Asp Lys Glu 
645 650 655 

His Leu Met Phe Glu Met Asn Phe Glu Lys Thr Phe Gly Gin Ser Ser 
660 665 670 

He Phe Val Thr Ser Thr Leu Met Glu Glu Gly Gly Val Pro Pro Ser 
675 680 685 

Ser Ser Pro Ala Val Leu Leu Lys Glu Ala He His Val He Ser Cys 
690 695 700 



Gly Tyr Glu Asp Lys Thr Glu Trp Gly Thr Glu Leu Gly Trp He Tyr 

7 05 710 715 720 

Gly Ser He Thr Glu Asp He Leu Thr Gly Phe Lys Met His Cys Arg 
7 25 730 735 

Gly Trp Arg Ser He Tyr Cys Met Pro Lys Arg Pro Ala Phe Lys Gly 
740 745 750 

Ser Ala Pro He Asn Leu Ser Asp Arg Leu Asn Gin Val Leu Arg Trp 
755 760 765 

Ala Leu Gly Ser Val Glu He Phe Phe Ser Arg His Ser Pro Leu Trp 



770 



775 



780 



Tyr Gly Tyr Lys Gly Gly Lys Leu Lys Trp Leu Glu Arg Phe Ala Tyr 
785 790 795 soo 
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Ala Asn Thr Thr He Tyr Pro Phe Thr Ser He Pro Leu Leu Ala Tyr 
805 810 815 

Cys He Leu Pro Ala He Cys Leu Leu Thr Asp Lys Phe He Met Pro 
820 825 ' " 830 

Pro He Ser Thr Phe Ala Ser Leu Phe Phe He Ser Leu Phe Met Ser 
835 840 845 

He He Val Thr Gly He Leu Glu Leu Arg Trp Ser Gly Val Ser He 
850 855 860 

Glu Glu Trp Trp Arg Asn Glu Gin Phe Trp Val He Gly Gly He Ser 
865 870 875 880 

Ala His Leu Phe Ala Val Val Gin Gly Leu Leu Lys He Leu Ala Gly 
885 890 895 

He Asp Thr Asn Phe Thr Val Thr Ser Lys Ala Thr Asp Asp Asp Asp 
900 905 910 

Phe Gly Glu Leu Tyr Ala Phe Lys Trp Thr Thr Leu Leu He Pro Pro 
915 920 925 

Thr Thr Val Leu He He Asn He Val Gly Val Val Ala Gly He Ser 
930 935 940 

Asp Ala He Asn Asn Gly Tyr Gin Ser Trp Gly Pro Leu Phe Gly Lys 
945 950 955 960 

Leu Phe Phe Ser Phe Trp Val He Val His Leu Tyr Pro Phe Leu Lys 
965 970 975 

Gly Leu Met Gly Arg Gin Asn Arg Thr Pro Thr He Val Val He Trp 
980 985 990 

Ser Val Leu Leu Ala Ser He Phe Ser Leu Leu Trp Val Arg He Asp 
995 1000 1005 

Pro Phe Val Leu Lys Thr Lys Gly Pro Asp Thr Ser Lys Cys Gly He 
1010 1015 1020 

Asn Cys 
1025 

<210> 28 
<211> 701 
<212> PRT 

<213> Gossypium hirsutum 
<400> 28 

Asp Tyr Pro Val Glu Lys Val Ser Cys Tyr Val Ser Asp Asp Gly Ala 
1 5 10 15 

Ala Met Leu Thr Phe Glu Ala Leu Ser Glu Thr Ser Glu Phe Ala Arq 
20 25 30 

Lys Trp Val Pro Phe Cys Lys Lys Tyr Asn He Glu Pro Arg Ala Pro 
35 40 45 
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Glu Trp Tyr Phe Ala Gin Lys He Asp Tyr Leu Lys Asp Lys Val Gin 
50 55 60 

Thr Ser Phe Val Lys Glu Arg Arg Ala Met Lys Arg Glu Tyr Glu Glu 
65 7 0 75 ~ 80 

Phe Lys Val Arg Val Asn Gly Leu Val Ala Lys Ala Gin Lys Val Pro 
85 90 - 95 

Glu Glu Gly Trp lie Met Gin Asp Gly Thr Pro Trp Pro Gly Asn Asn 
100 105 no 

Thr Arg Asp His Pro Gly Met He Gin Val Phe Leu Gly Gin Ser Glv 
115 120 125 

Gly Leu Asp Ala Glu Gly Asn Glu Leu Pro Arg Leu Val Tyr Val Ser 
130 135 140 



Arg Glu Lys Arg Pro Gly Phe Gin His His Lys Lys Ala Gly Ala Met 
145 150 155 160 

Asn Ala Leu Val Arg Val Ser Ala Val Leu Thr Asn Gly Ala Phe Leu 
165 170 175 

Leu Asn Leu Asp Cys Asp His Tyr He Asn Asn Ser Lys Ala Leu Arg 
180 185 190 

Glu Ala Met Cys Phe Leu Met Asp Pro Asn Leu Gly Lys Gin Val Cys 
195 200 205 

Tyr Val Gin Phe Pro Gin Arg Phe Asp Gly He Asp Arg Asn Asp Arg 
21° 215 220 

Tyr Ala Asn Arg Asn Thr Val Phe Phe Asp He Asn Leu Arg Gly Leu 
225 230 235 240 

Asp Gly He Gin Gly Pro Val Tyr Val Gly Thr Gly Cys Val Phe Asn 
245 250 255 

Arg Thr Ala Leu Tyr Gly Tyr Glu Pro Pro Leu Lys Pro Lys His Arg 
260 265 ' 270 

Lys Thr Gly He Leu Ser Ser Leu Cys Gly Gly Ser Arg Lys Lys Ser 
275 280 285 

Ser Lys Ser Ser Lys Lys Gly Ser Asp Lys Lys Lys Ser Gly Lys His 
290 295 300 

Val Asp Ser Thr Val Pro Val Phe Asn Leu Glu Asp He Glu Glu Gly 
305 310 315 320 

Val Glu Gly Ala Gly Phe Asp Asp Glu Lys Ser Leu Leu Met Ser Gin 
325 330 335 

Met Ser Leu Glu Lys Arg Phe Gly Gin Ser Ala Val Phe Val Ala Ser 
340 345 350 

Thr Leu Met Glu Asn Gly Gly Val Pro Gin Ser Ala Thr Pro Glu Thr 
355 360 365 
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Leu Leu Lys Glu Ala He His Val He Ser Cys Gly Tyr Glu Asp Lvs 
370 375 380 

Thr Asp Trp Gly Ser Glu He Gly Trp He Tyr Gly Ser Val Thr Glu 
385 390 395 40 0 

Asp He Leu Thr Gly Phe Lys Met His Ala Arg Gly Trp Arg Ser He 
405 410 * " 415 

Tyr Cys Met Pro Lys Arg Pro Ala Phe Lys Gly Ser Ala Pro He Asn 
420 425 430 

Leu Ser Asp Arg Leu Asn Gin Val Leu Arg Trp Ala Leu Gly Ser Val 
435 440 " 445 

Glu He Leu Phe Ser Arg His Cys Pro- He Trp Tyr Gly Tyr Ser Glv 
450 455 460 

Arg Leu Lys Trp Leu Glu Arg Phe Ala Tyr Val Asn Thr Thr He Tyr 
465 470 475 480 

Pro Val Thr Ala He Pro Leu Leu Met Tyr Cys Thr Leu Pro Ala Val 
485 490 495 

Cys Leu Leu Thr Asn Lys Phe He He Pro Gin He Ser Asn Leu Ala 
500 505 sio 

Ser He Trp Phe He Ser Leu Phe Leu Ser He Phe Ala Thr Gly He 
515 520 525 

Leu Lys Met Lys Trp Asn Gly Val Gly He Asp Gin Trp Trp Arq Asn 
530 535 540 

Glu Gin Phe Trp Val He Gly Gly Val Ser Ala His Leu Phe Ala Val 
545 550 555 560 

Phe Gin Gly Leu Leu Lys Val Leu Ala Gly He Asp Thr Asn Phe Thr 
565 570 575 

Val Thr Ser Lys Ala Ser Asp Glu Asp Gly Asp Phe Ala Glu Leu Tyr 
580 585 590 

Met Phe Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu Leu He 
595 600 605 

He Asn Leu Val Gly Val Val Ala Gly He Ser Tyr Val He Asn Ser 
610 615 620 

Gly Tyr Gin Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe 
62 5 630 635 640 

Trp Val He He His Leu Tyr Pro Phe Leu Lys Gly Leu Met Gly Arg 
645 650 655 

Gin Asn Arg Thr Pro Thr He Val Val Val Trp Ser He Leu Leu Ala 
660 665 670 



Ser lie Phe Ser Leu Leu Trp Val Arg He Asp Pro Phe Thr Thr Arg 
675 680 685 
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Val Thr Gly Pro Asp Val Glu Gin Cys Gly He Asn Cys 
690 695 700 

<210> 29 
<211> 1081 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 29 

Arg Pro Arg Leu He Ala Gly Ser His Asn Arg Asn Glu Phe Val Leu 
15 10 15 

He Asn Ala Asp Glu Asn Ala Arg He Arg Ser Val Gin Glu Leu Ser 
20 25 30 

Gly Gin Thr Cys Gin He Cys Arg Asp Glu He Glu Leu Thr Val Asp 
35 40 45 

Gly Glu Pro Phe Val Ala Cys Asn Glu Cys Ala Phe Pro Val Cys Arq 
50 55 60 

Pro Cys Tyr Glu Tyr Glu Arg Arg Glu Gly Asn Gin Ala Cys Pro Gin 
65 70 75 ' 80 

Cys Lys Thr Arg Phe Lys Arg Leu Lys Gly Ser Pro Arg Val Glu Gly 
85 90 " 95 

Asp Glu Glu Glu Asp Asp He Asp Asp Leu Asp Asn Glu Phe Glu Tyr 
100 105 no 

Gly Asn Asn Gly He Gly Phe Asp Gin Val Ser Glu Gly Met Ser He 
115 120 125 

Ser Arg Arg Asn Ser Gly Phe Pro Gin Ser Asp Leu Asp Ser Ala Pro 
130 135 140 

Pro Gly Ser Gin He Pro Leu Leu Thr Tyr Gly Asp Glu Asp Val Glu 
145 150 155 160 

He Ser Ser Asp Arg His Ala Leu He Val Pro Pro Ser Leu Gly Gly 
165 170 175 

His Gly Asn Arg Val His Pro Val Ser Leu Ser Asp Pro Thr Val Ala 
180 . 185 190 

Ala His Arg Arg Leu Met Val Pro Gin Lys Asp Leu Ala Val Tyr Gly 
195 200 205 

Tyr Gly Ser Val Ala Trp Lys Asp Arg Met Glu Glu Trp Lys Arq Lvs 
210 215 220 

Gin Asn Glu Lys Leu Gin Val Val Arg His Glu Gly Asp Pro Asp Phe 
225 230 235 * 240 

Glu Asp Gly Asp Asp Ala Asp Phe Pro Met Met Asp Glu Gly Arg Gin 
245 250 255 

Pro Leu Ser Met Lys He Pro He Lys Ser Ser Lys He Asn Pro Tyr 
260 265 270 
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Arg Met Leu He Val Leu Arg Leu Val He Leu Gly Leu Phe Phe His 
275 280 285 

Tyr Arg He Leu His Pro Val Lys Asp Ala Tyr Ala Leu Trp Leu He 
290 295 300 

Ser Val He Cys Glu He Trp Phe Ala Val Ser Trp Val Leu Asp Gin 
305 310 315 320 

Phe Pro Lys Trp Tyr Pro He Glu Arg Glu Thr Tyr Leu Asp Arg Leu 
325 330 335 

Ser Leu Arg Tyr Glu Lys Glu Gly Lys Pro Ser Gly Leu Ser Pro Val 
340 345 350 

Asp Val Phe Val Ser Thr Val Asp Pro Leu Lys Glu Pro Pro Leu He 
355 360 365 

Thr Ala Asn Thr Val Leu Ser He Leu Ala Val Asp Tyr Pro Val Asp 
370 375 380 

Lys Val Ala Cys Tyr Val Ser Asp Asp Gly Ala Ala Met Leu Thr Phe 
385 390 395 400 

Glu Ala Leu Ser Glu Thr Ala Glu Phe Ala Arg Lys Trp Val Pro Phe 
405 410 415 

Cys Lys Lys Tyr Cys He Glu Pro Arg Ala Pro Glu Trp Tyr Phe Cys 
420 425 430 

His Lys Met Asp Tyr Leu Lys Asn Lys Val His Pro Ala Phe Val Arg 
435 440 445 

Glu Arg Arg Ala Met Lys Arg Asp Tyr Glu Glu Phe Lys Val Lys He 
450 455 460 

Asn Ala Leu Val Ala Thr Ala Gin Lys Val Pro Glu Asp Gly Trp Thr 
465 470 475 * 480 

Met Gin Asp Gly Thr Pro Trp Pro Gly Asn Ser Val Arg Asp His Pro 
485 490 495 

Gly Met He Gin Val Phe Leu Gly Ser Asp Gly Val Arg Asp Val Glu 
500 505 510 

Asn Asn Glu Leu Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro 
515 520 525 

Gly Phe Asp His His Lys Lys Ala Gly Ala Met Asn Ser Leu He Arg 
530 535 540 

Val Ser Gly Val Leu Ser Asn Ala Pro Tyr Leu Leu Asn Val Asp Cys 
545 550 555 560 

Asp His Tyr He Asn Asn Ser Lys Ala Leu Arg Glu Ala Met Cys Phe 
565 570 575 



Met Met Asp Pro Gin Ser Gly Lys Lys He Cys Tyr Val Gin Phe Pro 
580 585 590 
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Gin Arg Phe Asp Gly He Asp Arg His Asp Arg Tyr Ser Asn Arg Asn 
595 600 605 

Val Val Phe Phe Asp He Asn Met Lys Gly Leu Asp Gly Leu Gin Glv 
610 615 6 20 

Pro He Tyr Val Gly Thr Gly Cys Val Phe Arg Arg Gin Ala Leu Tvr 
625 630 ess 6 J Q 

Gly Phe Asp Ala Pro Lys Lys Lys Lys Gly Pro Arg Lys Thr Cys Asn 
645 650 ~ 655 

Cys Trp Pro Lys Trp Cys Leu Leu Cys Phe Gly Ser Arg Lys Asn Arq 
660 665 6 7o 

Lys Ala Lys Thr Val Ala Ala Asp Lys- Lys Lys Lys Asn Arg Glu Ala 
675 680 685 

Ser Lys Gin He His Ala Leu Glu Asn He Glu Glu Gly Arq Glv His 
690 695 700 

Lys Val Leu Asn Val Glu Gin Ser Thr Glu Ala Met Gin Met Lvs Leu 
705 710 715 720 

Gin Lys Lys Tyr Gly Gin Ser Pro Val Phe Val Ala Ser Ala Arg Leu 
725 730 735 

Glu Asn Gly Gly Met Ala Arg Asn Ala Ser Pro Ala Cys Leu Leu Lys 
740 745 750 

Glu Ala He Gin Val He Ser Arg Gly Tyr Glu Asp Lys Thr Glu Trp 
755 760 765 

Gly Lys Glu He Gly Trp He Tyr Gly Ser Val Thr Glu Asp He Leu 
770 775 780 

Thr Gly Ser Lys Met His Ser His Gly Trp Arg His Val Tyr Cys Thr 
785 790 795 800 

Pro Lys Leu Ala Ala Phe Lys Gly Ser Ala Pro He Asn Leu Ser Asp 
805 810 815 

Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly Ser Val Glu He Phe 
820 825 830 

Leu Ser Arg His Cys Pro He Trp Tyr Gly Tyr Gly Gly Gly Leu Lys 
835 840 845 

Trp Leu Glu Arg Leu Ser Tyr He Asn Ser Val Val Tyr Pro Trp Thr 
850 855 860 



Ser Leu Pro Leu He Val Tyr Cys Ser Leu Pro Ala He Cys Leu Leu 
865 870 875 " 880 

Thr Gly Lys Phe lie Val Pro Glu He Ser Asn Tyr Ala Ser He Leu 
885 890 895 

Phe Met Ala Leu Phe Ser Ser He Ala He Thr Gly He Leu Glu Met 
900 905 910 
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Gin Trp Gly Lys Val Gly lie Asp Asp Trp Trp Arg Asn Glu Gin Phe 
915 920 925 

Trp Val He Gly Gly Val Ser Ala His Leu Phe Ala Leu Phe Gin Gly 
930 935 940 

Leu Leu Lys Val Leu Ala Gly Val Asp Thr Asn Phe Thr Val Thr Ser 
945 950 955 960 

Lys Ala Ala Asp Asp Gly Glu Phe Ser Asp Leu Tyr Leu Phe Lys Trp 
965 970 975 

Thr Ser Leu Leu He Pro Pro Met Thr Leu Leu He lie Asn Val He 
980 985 990 

Gly Val He Val Gly Val Ser Asp Ala- He Ser Asn Gly Tyr Asp Ser 
995 1000 1005 

Trp Gly Pro Leu Phe Gly Arg Leu Phe Phe Ala Leu Trp Val He He 
1010 1015 1020 

His Leu Tyr Pro Phe Leu Lys Gly Leu Leu Gly Lys Gin Asp Arg Met 
1025 1030 1035 • 1040 

Pro Thr He He Val Val Trp Ser He Leu Leu Ala Ser lie Leu Thr 
1045 1050 1055 

Leu Leu Trp Val Arg Val Asn Pro Phe Val Ala Lys Gly Gly Pro He 
1060 1065 ' 1070 

Leu Glu He Cys Gly Leu Asp Cys Leu 
1075 * 1080 
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2. Q Claims Nos.: 
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an extent that no meaningful International Search can be carried out, specifically: 



3. | j Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 
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This International Searching Authority found multiple inventions in this international application, as follows: 
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1 — 1 searchable claims. 
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1 — 1 of any additional fee. 



3. | I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
1 1 covers only those claims for which fees were paid, specifically claims Nos.: 



4. | I No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 
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This International Searching Authority found multiple (groups*of) 
inventions in this international application, as follows: 

1. Claims: 1-6,13-18 all partially 

Nucleic acid fragments encoding barley cellulose 
synthase, and corresponding polypeptide as represented by SEQ 
ID N0s:l and 2, fragments encoding amino acid sequence 90% 
identical thereto, transformed hosts containing andmethods 
for obtaining said sequences, methods for altering expression 
and methods for evaluating inhibitors using said sequences. 



2. Claims: 1-6,13-18 all partially and 7-12 all completely 

Nucleic acid fragments encoding corn cellulose synthase, and 
corresponding polypeptide as represented by SEQ ID N0s:3-10, 
fragments encoding amino acid sequence 90% identical 
thereto, transformed hosts containing and methods for 
obtaining said sequences, methods for altering expression and 
methods for evaluating inhibitors using said sequences. 



3. Claims: 1-6,13-18 all partially 

Nucleic acid fragments encoding rice cellulose synthase, and 
corresponding polypeptide as represented by SEQ ID N0s:ll 
and 12, fragments encoding amino acid sequence 90% identical 
thereto, transformed hosts containing and methods for 
obtaining said sequences, methods for altering expression and 
methods for evaluating inhibitors using said sequences. 



4. Claims: 1-6,13-18 all partially 

Nucleic acid fragments encoding soybean cellulose 
synthase, and corresponding polypeptide as represented by SEQ 
ID N0s:13-18, fragments encoding amino acid sequence 90% 
identical thereto, transformed hosts containing and methods 
for obtaining said sequences, methods for altering expression 
and methods for evaluating inhibitors using said sequences. 



5. Claims: 1-6,13-18 all partially 

Nucleic acid fragments encoding wheat cellulose synthase, and 
corresponding polypeptide as represented by SEQ ID 
NOs: 19-22, fragments encoding amino acid sequence 90% 
identical thereto, transformed hosts containing and methods 
for obtaining said sequences, methods for altering expression 
and methods for evaluating inhibitors using said sequences. 



6. Claim : 18 partially 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



Method for evaluating a compound for inhibitory activty on 
cellulose synthase comparing activity of cellulose synthase 
produced in a transformed host with and without the addition 
of the compound, not covered by any of the previous groups 
of claimed inventions 1-5. 
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